FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 

AGCAGGT C TGAAGACTAACAT T T TGT GAAGT T GTAAAACAGAAAACCT GT TAGAAATGT GGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 

TCATACAT TACT GCAGTAACACT CCACCATATAGAC CCGGC TT TACC T TATATCAGTGACAC TGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 

C T ACCAT TTAT GTTCGT TATAAGCAAGT TCATGCTCTGAGT CCT GAAGAGAACGT TATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 

TATATATGT TT GTTCAGACCAT CCT T TCC TACCAAAT GCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT CAGT T T T GCACAGTGGCAAT T T TGGGACTGAT TT AGAACAGAAACTCCAT TGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 

GAGAT AT T TGAT GAARGGATAA&ATAT T TC TG TAAT GATT AT GAT T CTCAGGGAT T GGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 

GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
CAAGAAGAC TATT AAAAACACCTAT GCCTATAC T T T T T TAT C TCAGAAAATAAAGT CAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8,39, NX(S/T): 0 

MWW FQQGL S FLP SALVI W T SAAF I FS Y I TAVT LHH I DPALP Y I S DTGTVAPEKCL FGAMLN I AAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSS.VLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 

Important features : 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 
N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGC CATGA GTT TCCT CAT CGACTCCAGCATCATGAT TACCTCCCAGATACTATT TTTT G 
GATT T GGGTGGCT TTTCTTCATGCGCCAATT GTTTAAAGACTATGAGATACGTCAGTATGTTG TACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CAT T CT CAGCCCAAAACAT GGGAT CTTAT CCAT AGAACAGCTCATCAGCCGGGTTGGTGTGATT GGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
O CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGT T CCAGAAGGGGGAAGTGCATAACA 

y AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 

ii ^ j 

CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

^ g 

Ifl ATATGCTACCAAGGAGAGAATAGAAT^ 

gi ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Hi GGGAAAACGGATCCTGT CACAAGAGGCAT TGAGAT CACTGT GAATTAT CTGGGAATCCAATTTGATGT 

* GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
JjJ TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
S| CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
S* j AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
£3 ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
111 GAGAAGCAAAT GGCACCT TGAACT TAAGCCTACTACAGACTGTTAGAGGCCAGTGGT TT CAAAATTTA 

GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
T GT AGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACT CTGGGGCAAGACATGT 
CTATGGT AGCT GAGCCAAACACGT AGGAT TT CCGTT TTAAGGTT CACATGGAAAAGGTTATAGCTT T G 
CCTT GAGAT TGACTCATTAAAATCAGAGACT GTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVT FSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAWCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSALSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 

AGAAC TC TCCAT CCGGAC TAGT TAT T GAGCAT CT GCC T CTCATAT CACCAGTGGCCAT CTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAAT TTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 

T T GAAAGC TAGCT TT GAAACTT GCAGC TATGGCT GGGT TGGAGAT GGAT TCGTGGT CATC TC TAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 

GCCGACAGT TT GCAGCC TAT TGT TACAAC TCAT C TGATAC T TGGACT AAC T CGT GCAT T CCAGAA 
AT TAT CACCACCAAAGATCCCATAT T CAACACTCAAAC TGCAACACAAACAACAGAATT TATT GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 

GAAAC T AGCACCATGT CTACAGAAAC TGAACCAT T TGT TGAAAATAAAGCAGCAT T CAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGT C T T GGAT T T T GCTAT GT CAAAAGGT ATGTGAAGGCCT TCCC TT TTACAAACAAGAAT 
H ! CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
O TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
Q GAT GCCTGGAAGCTGAAGTT TAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTTTCT TT 

m CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
=f J CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
pi TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
SI CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
M AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 

GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
L GGCAGAAAAG TT TAGCCCCT GAAAGCCATGGAGAT TCT CATAACTT GAGACC TAATC TC TGTAAA 

y GC TAAAATAAAGAAATAGAACAAGGC TGAGGAT AC GACAGTACAC T GT CAGCAGGGACTGTAAAC 

HI ACAGACAGGGTCAAAGTGT TTTCTCTGAACACATTGAGTTGGAATCACTGTT TAGAACACACACA 

rj C T TAC T T TT T C T GGT CT CTACCACTGC TGATAT T T T CT CTAGGAAAT ATAC T T T TACAAGTAACA 

jpl AAAATAAAAACTCTTATAAATTTCTAT TTTTATCTGAGT TACAGT^AATGATTACTAAGGAAGAT T 

S ACT CAGTAAT T TGT T TAAAAAGTAATAAAAT T CAACAAACAT TTGCTGAATAGCTACT ATAT GT C 

W AAGTGC T GTGCAAGGTAT TACACT CTGTAAT T GAATAT TAT TCCT CAAAAAAT T GCACATAGTAG 

PJ AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 

T T T TAT T T T T GCTGAGAC TAAT CT TAT T CAT T T TC TCTAATAT GGCAACCAT TATAACC TT AAT T 
TAT TAT TAACAT AC CT AAGAAGTACAT TGT TACC T C TATATACCAAAGCACATT T TAAAAGTGCC 
AT TAACAAATGTATCACTAGCCCTCCT TT TTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 

TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSI WTTRLLVQGSLRAEELS I QVSCRIMGI TLVSKKANQQLNFTEAKEACRLLG 
LSIiAGKDQVETALKAS FETCSYGWVGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
S SDTWTNSC I PE 1 I TTKDPI FNTQTATQTTEFI VSDSTYSVASPYST I PAPTTTPPAPASTS I PR 
RKKLICVTEVFMETSTMSTETEPFVENJCAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCW^ 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 



amino acids 235-254 



N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 



Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 




Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCAT CATCAGAAGT GAACCT GGCAAACT TACCT CCCAGC TATCACAATGAGACCAACACAGAC 
ACGAAGG T T GGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAAT GGT CT T T TCAGAGACAGT T ATCACAT C T GTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
m GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
f I CCAGGAGCTGGAGGACC TGGAGAGGAGCC TGAC TGAAGAGAT GGCGCTGGGGGAGCCTGCGGC T G 

W CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCrAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 



33A 



CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Hj CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
8 GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
m TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGGATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
W TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACAC TG TCCCTC TT TGGCAGT TGCATTAG TAAC T T TGAAAGGTATAT GACT GAGCGTAGCA 
TACAGGTTAACCTGCAGAAACAGTACT TAGGTAATTGTAGGGCGAGGAT TATAAATGAAATTTGC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GATTGTATAAGCATGCTT TCTT TGAGTTTTAAATTATGTATAAACATAAGTTGCAT TTAGAA 
ATC AAGCATAAAT CAC T T CAACT GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEVNIANLPPSYHNETNTOT 

TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVW-GHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHS LVYVCKPT FVGSRDQDGE ILLPREVPDE YEVGS FMEE VRQELEDLE 
RSLTEEMALGE PAAAAAALLGGEE I 



Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 



N-myristoylation site. 

m . amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGC.TCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 

AAGAT T T TAT T AAAGATAT T T TG T T AACTC 
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FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AF I RTLRYHT GS LAFGAL I LTLVQ I ARV I L 
E YI DHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAY IMI AI YGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLS F.FFFSGRI PGLGKDFKSPHLNYYWLPIMTS ILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 27 6-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE ll 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTGTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTGCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCT GAT GCAGACCGAGGAGT GCCCACCT AT GCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLS CASCLCGSAPC I LCS CCPASRNST VSRL I FT FFL FLGVLVS I IMLS PGVES QL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLmLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLI I FLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFH FCLVLAS LHVMMTLTNWYKPGE TRKM I S T WTAVWVK I CAS WAGLLL YLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE la 



CGGGCCAGCCTGGGGCGGCCGGC'CAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGACT CC TGTAACCCTCCT CCAGGATGAACCACCT GCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACGCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTAT GACTACTATT CT TCATAT 
T T T GATATAT T TC T T C T GGCAGT T T T TCGAT T TAAAGTGT TAATACT TGCAT AT GCT GT GTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
ih CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 

TO* 1 

Q GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
y * GAGAAACCACT T T TAGAAC TATGAGTACTAC TT T T GT TAAATGT GAAAAACCCTCACAGAAAGTC 



ATCGAGGCAAAAAGAGGCAGGCAGT GGAGTC TCCC TGTCGACAGTAAAG TTGAAATGGTGACG TC 



CAC TGCTGGCT TTAT TGAACAGC TAATAAAGAT T TAT T TAT TGTAATACC T CACAAACGTT GTAC 

Z 3 

%J CATATGCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 



TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 



AAAAGACTTAATATATTGAAGTAACACTT TT TTAGTAAGCAAGATACCT TTTTAT TTCAATTCAC 

sat 



AGAATGGAATT TT TT TGTTTCATGTCTCAGATTTATTTTGTAT TT CT TT TTTAACACTC TACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 



pi ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 



m CTT GCAAAAGGGGAAGAAAGGAATT GCGAATACAT GTAAAAT GT CACCAGACATT TGTAT TATT T 

TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGAC TTAAACCAT TCATAT CATG T T T CC T T TGCGT T CAGC CAAT T TCAAT TAAAAT GAA 
C TAAAT TAAAAA 
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FIGURE 14 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF' 
VTLLWI I ELNVNGGI ENTLEKEVMQYDYYS S YFD I FLLAVFRFKVLI LAYAVCRLRHWWAI ALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFIIAWIE 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 




amino acids 54-72, 100-118, 130-144, 146-166 



N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 




APPJD=10063587 



Page 164 of 3 18 



FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTGAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGAGAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGGCCTGCCTGGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGGCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGCT GTGTGAC TC T AGT CT TGGCC CCAGGAAGCGAAGGAACAAAAGAAAC TGGAAAGGAAGATGC 
TT TAGGAACAT GTTTTGCTTTT T T AAAATATAT AT AT T TATAAGAGATCCT T TCCCAT T TAT TC T 
GGGAAGATGTT TT T CAAACT CAGAGACAAGGAC T T TGGT TT T TGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 
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MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLIAAA^ 

TRCHFPPPCNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVT QLRPNAT YS VCVMPLGPGRVPEGEEACGEAHT PPAVHSNHAPVT QAREGNLPLL I APALAAV 
LLAAIAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQS PLHAKPY I 

Important features: 
Signal peptide : 



S 

i'J amino acids 1-23 



Transmembrane domain; 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGAT TCCAAGACTACTT TGACATCAGATGAGTCAGTA 

AAGGACC AT ACT AC T GCAGGCAGAGT AGT TGCT GGT CAAAT AT T T C T T GAT T CAGAAGAAT C T GA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGC T T TCTAGAGTCT CCAAATCCAGAAAACAAGGACT AT GAAGAGCCAAAGAAAGT A 
CGGAAACCAGCTT TGACCGCCAT T GAAGGCACAGCACATGGGGAGCCC T GCCACT T CCC T T T T CT 
T T T CCTAGATAAGGAG TATGAT GAAT G TACAT CAGATGGGAGGGAAGATGGCAGACTGT GGT GTG 
CT ACAACC TATGACTACAAAGCAGATGAAAAGT GGGGC T T T T GT GAAAC TGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGC AGAAATGAT GT AT CAAAC T GGAAT GAAAAT CCT TAAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCT TT TATT TGGTGAT TACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

T T GGAGC TC T T GGGGGCAATCT AATAGCCC ACAT GG TT T TGGT AAGTAGACT TTAGT GGAAGGC T 
AAT AAT AT T AAC AT CAGAAGAAT T T GTGGT T TAT AGCGGCCACAAC T T T T TCAGCT TT CAT GATC 
CAGAT T TGC T TGTAT T AAGACCAAATAT TCAG T TGAAC T TCC TT CAAAT TCT T GT TAATGGATAT 
AACACATGGAATC TACATGTAAAT GAAAGT T GGT GGAGT CCACAAT T T T TCT T T AAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

GAATTGTCAGAATCAT TTTTTACATTAGATTATCATAAT TT TAAAAATTT TTCT TTAGT TT TTCA 
AAATTT TGTAAATGGTGGCTATAGAAAAACAACATGAAATAT TATACAATATTTTGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC T CAT T T T T AATAAAAT TATGTC TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAA^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVY YT FGALGGNLI AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, ■ 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 



APP ID=1 0063587 



Page 168 of 318 



FIGURE iq 



ft 



AAT T C AGAT T T T AAGCCCAT T C T GC AG TGGAAT T T CAT GAAC T AGCAAGAGGACACC AT C T T C T T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATT TCAT GTAATCGCTGCC TGTC TGACTGAATCAGGATCAACAGC TT TAAAGGCAGA 
AACCT CAGAGAGACT TCGTACTGTGCT TC TGGATG TGACCGACCCAGAGAATGTCAAGAGGAC TG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCG TGC T GGCTCCCACT GACT GGC TGACACTAGAGGACTACAGAGAACCTATT GAAGTGAA 
CCT GT TTGGAC TCAT CAGTGT GACACTAAATAT GCTTCC TT TGGT CAAGAAAGCT CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TAT GCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTT TGGT GTGCACGTCTC 



fK ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
m TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 

AGTCT AGACAAACTGAAAGGCAATAAATCC TAT GT GAACATGGACCTCTCTCCGG TGG TAGAG TG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
T T T TC TGGATACC TC TGTC TCACAT GCCAGCAGCT T TGCAAGACT T T TTAT TGTT GAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CAT C T C T TAT C T AAAT AT T AAAAG AT AAG T C AAC CC AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



p. N 

Ui 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKZVETSERLRTVLLDVTDPENWRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTL^LPLVKKAQGRVI3WSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSTONMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of* the protein : 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGC ATGG AGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACGTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TACT GAT TCCCAAAT GGAT GATGT T GAAGT TGT TT AT ACAATT GA 
CAT TCAGAAAT AT AT TCCATGCTAT CAGCT TT TTAGCT T T TATAAT T CT TCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGT T TAGAGAGAGGC TGC T TCACAAAAACT TGCAGGAGCAT T T 
T TCAAACCAAGACCT TGT T T T TC TGCTAT TAACACCAAG TATAATAACAGAAAGCTGCTCTAC T C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCTVATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTTTAGCCGAGCAGTACAAACACACAGCTCTAAATTTTTTGAAGAAGATGGATCCTTAAAGGAGG 
T ACATAAGATAAAT GAAATGTATGCTTCAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGT AAAGGAT GTAAACAGAT TAAAACGAGAAAT T GA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
AAATCAT TAAGCATAAAGCCT TAGACT TAGATGACAGATGGCAAT TCAAGAGAT CT CGGT TGT TA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
¥ 1 GAGCAGCCCAGAAACAGAT GAAGAAAT TGAAAAGAT GAAGGGT T T T GGTGAATAT TCACGG TC T C 

CTACAT TT TGATCCT TT TAACCT TACAAGGAGATT TTTT TATT TGGCTGATGGGTAAAGCCAAAC 



3^ 



•-IT V,-^ 



in 



SIS 5; 



ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T GT TTGCAGTAATACACAGATAACT CT TAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGAT TGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGTAT T 
GAGAC CATGTC TATTAAAAAAT AAAAT GGAAAAGCAAGAATAGCCT TAT T T TCAAAATATGGAAA 
G AAAT T TAT AT GAAAAT T TAT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T T AG AAG T A 
CAT TATGGC TAGAGT TGCCAGATAAAATGCT GGAT ATCATGCAATAAAT T TGCAAAACATCATC T 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSmTKKOTVGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHS SKFFEEDGS LKEVHKINEMYASLQEELKS I CKKVEDSEQAVDKLVKDVNRLKRE IEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MYEHTDI PEAS PASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSS PET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 




N-glycosylation sites. 

amino acids 75-79, 322-32 6 



N-myristoylation site. 



amino acids 184-154 




Growth factor and cytokines receptors family. 



amino acids 134-150 



APP ID-1 0063587 



Page 172 of 31 8 



FIGURE 23 



M 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGGCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 

AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT^ 

CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AAT GATAT GTTTGT GAAGACATATGGCCATTTATACAT GCAAAATT CTGAGCTAT TT AAAGAT CT CT TCGTAGAGTT G 
AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCGCAGTGTACCCATGCCCTGTTGAAG 
AT GAT CTACTGCT C CCACT GC CG GGGT CTCGTGACT GT GAAGCCAT GTTACAACTACT GCTCAAACATCAT GAGAGGC 
T GTTT GGCCAACCAAGGGGAT CT CGATTTT GAAT GGAACAATTTCATAGAT GCTATGCTGATGGTG GCAGAGAGGCTA 
GAGGGTC CTTTCAACAT TGAAT C GGT CAT GGAT CC CATCGAT GTGAAGATT TCT GAT GCT ATTAT GAACAT GCAG GAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTT GGAC CGACT GGT TACT GATGT CAAGGAGAAACTGAAACAGGCCAAGAAATTCT GGTCCTC C CTT C C GAGCAAC 
GT TT GCAAC GAT GAGAGGATGGCTG CAGGAAAC GGCAAT GAGGAT GACT GT TGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTT TGCAGTGAC AGGAAAT GGATTAGCCAACCAGGGCAACAACC CAGAGGT C CAGGT T GACAC CAGCAAACCAGAC 
ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGAGGTG 
GACTT C TTT GATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGT GAGTATCAGCAGT GC CCTT CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTAC CTCCT CACT GT CT T CT GCAT CT T GTT CCTG GTTATGCAGAGAGAGT GGAGATAATT CT CAAACT CT GAG 
AAAAAGTGTT CAT CAAAAAGTT AAAAGGCAC CAGTTAT CACTT TT CTAC CATCCT AGT GACTTTGCTTTTTAAATGAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAG GGACT GTGCATTGAGTTGGTT C CT G CT CC CCCAAACCAT GTT AAAC GT GGCT AACAGT GTAGGT ACAGAA 
CT ATAGTTAGTTGT GCATTT GT GATT TTAT CACT CTATTATTT GTTT GT AT GT TT TTTT CTCATT T CGT TT GT GG GTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAATATTAAATAGCTGTACAGAAGCAGGTTT TATTTAT CAT GTTAT CTTAT TAAAAGAAAAAGCC CAAAAAGC 
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FIGURE 24. 

MARFGLPALLCTIA^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTVKPG YNYC S NIMRGCLANQGDLDFEWNN FI DAMLMVAERLEGP FNI ESVMDP I DVKI S DAIMN 
MQDNS VQVSQKVFQGCGPPKPLPAGRI SRS I SESAFSARFRPHHPEERPTTAAGT SLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

KPD I L I LRQ IMALRYMT SKMKNAYNGNDVD FFD ISDESSGEGSGS GCE YQQC PSE FDYNAT DHAG 
KS ANEKADS AGVRPGAQAYLLT VFC I L FLVMQRE WR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502- 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins . 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



si 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCT TC TAGGAGATGGCT CCAGGAAGGCGGCCAAGAATGT GAGT GCAAAGAT T GGTTCC TGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGT GAAGAAAAGAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT TC CAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
T TC T TC TCCCACC TC AC TC TCCCACT GTACCC ACCCCT AAAT CAT T CCAGT GCTCT CAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 



^ AGCT AGTGTCAT T TAACCT TAAAT GCAAT CAGGAAAG TAGCAAACAGAAGTCAATAAATATTT TT 



5 



AAAT GT CAAAAAAAAAAAAAAAAAA 



IS** 

•» — ■* 

1 fr* 



APP ID=1 0063587 



Page 175 of 318 



FIGURE 26 



h si 

us 



If! 

1; 3 



r v- 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 



N-myristoylation sites. 

amino acids 27-33, 46-52 



* 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCGTGC TGACT CCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAACGGGAGCTCTT CAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGC CCACC TGGAAACACATT GGGGATGGCT GCT GCCTCACCAGAGAGACC TGGAAGGAT CT T GA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AAT CTCTAACT T CAATCCCGCCT T CGACAGT GAAAAAGC TC TACTT CTACGC T GACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACTCAAAGAGGCAGAGGT TT TGTTCTCAAATAT TT T TTAATAAATAGACGAA 
ACCACG 
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FIGURE 28 

MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCS VQSPRKVLQEVRRVLRPGGVLFEWEHVAE PYGSWAFMWQQVFE PTW 
KH I GDGCCLTRETWKDLENAQFSE I QMERQP PPLKWLPVGPHIMGKAVKQS FPS SKALI C S FPS L 
QLEQATHQP I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-ntyristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 

AGTTAAAAGCTGAAAAAARAAAAAftAAAAAAftAAAAAAA 
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FIGURE so 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



Growth factor and cytokines receptors family 

amino acids 3-18 
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FIGURE 31 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 

AT T T GCATCTGTT T T GATAAAT GAT GT TGACACCCT CCACCGAATT CT AAGTGGAAT CATGTCGG 
GAAGAGATACAATCCT TGGCCT GTGTATCC TCGCAT TAGCC T T GT C T T TGGCCAT GAT GT TTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 

CAGAAAGGGAAAATATGAAGTGCGT GC TGGGGT T TGCTATCGTAT CCACAGGCAT CACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCAT TGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

AACCAGAATGCATATAC TACAAC TGCTATTAATGGGACAGAT T TCTGTACATCAGCAAAAGATGC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 

AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCAC T CAT TAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCATTGT GAGATAGA.TACCCAT TTAGGTATC TGTACCTGGAAAACATT 
T CCT T C TAAGAGCCAT T TACAGAATAGAAGAT GAGACC AC T AGAGAAAAGT T AGT GAAT T T T TT T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCIIALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWW 
LDTERENMKCVLGFAI VS TGI TAVLLVL I FVLRKRIKLT VELFQ I TNKAI S SAP FLLFQPLWT FA 

I LI FF5JVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWS YHLI GLIWTSE FILACQQMT IA 

GAWTCYFNRSKNDPPDHP I LS S LS I LFFYHQGTWKGS FL I S WRI PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 

FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 

VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 



Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 



**' acids 147-153, 178-184, 196-202, 296-275, 342-348 



*■*» amino 
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FIGURE 33 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 

ATAAT T TGTAT GGGAT T T GTGAT GCAGGAAAGC CT AAGGGAAAAAGAATAT TCAT TCT G TGT GGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT T CACT GTGCC TCAGAT CAAC TGC 
GAT GT CAAAGCC GGAAAGATCAT CGATCCT GAG T TCATTGTGAAAT GT CCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCG TC T TAGAAAG TAAACCCAAAAAGGGTGTAACCTACCCATCAGC TCT TACATAC TCAT CAT 
CGAAAAGTCCAGC T GCCCAAGCAGGTGAGACCACAAAAGCC TATCAGAGGCCACCT AT T CCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGATGGATCT CTGGT CCACT GCCAC C TACACAAGCAGCCAAAACAGGCCC 
AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 

GGAT GTCAGCC T GGGAC T TGT T CCAAAAGAAGAAT T GAGCACACAGT C T TTGGAGCCAGTATCCC 
f*| • TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 

SI CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
H CGGTCCACT GAT GGGTGTTGTCCAGTATGGAGACAACCC TGC TACT CACTTTAACCTCAAGACAC 

P * ACACGAAT TCTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTTTCTAAT 

yj GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
m GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

K GACT TGCGAGAGAGT CAGGAAT CAACATT TTCT T CAT CACCAT TGAAGGTGC T GC TGAAAAT GAG 

ft AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

^ GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

s? ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

Q GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

m ' AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

ml T GGAGT T TGGGT T CGACAAGTACAGCAGCAAGCC T GACAT CC TCAACGCCATCAAGAGGGTGGGC 

H TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

W GT CCAAGCCCAACAAGAGGAAGT TAATGATCCT CAT CACCGACGGGAGGTCC TACGACGACGTCC 

CI GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

m GCCCAAGAGGAGCT AGAAGT CAT T GCCACT CACCCCGCCAGAGACCAC TC CT T CT T T GT GGACGA 

GTT TGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATT TGTACAGAGT TCAACTCAC 

AGCC TCGGAAC TGAAT T CAGAGCAGGCAGAGCACCAGCAAGT GCT GCT T TAC TAAC T GACGT G T T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATG T C TT GT TAT TAT TCT T TGCCAT CATGC T T T T TCATATT CCAAAAC T TGGAGT TACAAAGA 
T GATCACAAACGT AT AGAAT GAGCCAAAAGGCT ACATCATG T T GAGGGT GCT GGAGATT T T ACAT 
T T T GACAAT T GT T T TCAAAATAAAT GT TCGGAATACAGT GCAGCCC T TAC GACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T T T T GTCATGACAATGTAGGAAT T GCTGAAT T AAATGT T TAGAAGGAT GAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAZVAAAZU^^AAAAA 
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FIGURE 34 



MRTVVLTMKASVIEMFLVLLVT 

CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 

ES FI VLE SKPKKGVT YPSALT YS SSKS PAAQAGETTKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYVVE 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILI TDGRSYDDVRI PAMAAHLKGVI TYAIGVAWAAQEELEVIATHPARDHS FF 
VDE FDNLHQYVPRI I QN I CTE FNS QPRN 

Important features : 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29/ 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da t ion site. 

amino acids 304-308 
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FIGURE 35 



CC GAGCAC AGGAGAT T GCCT GCGT TT AGGAGGTGGCTGCGTT GT GGGAAAAGCT AT C AAGGAAGAAAT TGC 
C AAACC ATGT CT T TTT T T CT GT TT T C AG AGT AGTT CAC AAC AGAT CTGAGT GTT TT AAT T AAGC AT GG AAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 

GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 

T GGT GG GGAT ATGAGGTT CTTACATTTT TCTT AT TAGGCC AAGAGGCTGAAAAGGAAG AC AAAATGTT GGC 
ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 

ACCT GACCT T G AAAACCAT TAT GGC AT T CAGGTGGGT AAC TGAGTT TT GCCCCAATGCCAAGTACGT AAT G 
AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 

GAAGT T TT TCAC AGGT TATCCT CT AATT GAT AAT TATTCCTAT AGAGG ATTTTACC AAAAAACCCAT ATTT 

CTT ACCAGGAGT AT CCT T TCAAGGT GTT CCCTCC AT ACTGCAGTGGGT T GGGTT AT AT AAT GT C C AGAGAT 

TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 

CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 

ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

AC ACT GAACT GAAACT CAT G AAAAAC CC AGAC TGGAGACT GGAGGGTT AC ACTT GTGATT T ATT AGT C AG G 

CC CTTCAAAGATGAT AT GT GGAGGAATT AAAT AT AAAGGAATT GGAGGTTTT TGCT AAAGAAAT T AAT AGG 

ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 

CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 

TATCTT AT GT GGAT TACCAATTTAAAAATAT ATGT AGT TCTGT GTC AAAAAACT T CTT CACT GAAGT TAT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 

AT TAT T T AAAAT T ACT TCAACT TTGT GT TTTT AAATGT TT T G ACGATTT C AAT AC AAG AT AAAAAGGAT AG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 

AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKM^ 

L S LE DEHLL YGD 1 1 RQD FLDT YNNLT LKT IMAFRWVTE FC PNAKYVMKTDTDVFI NT GNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 

KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 




N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 

amino acids 239-243 



Ly-6 / u-PAR domain proteins. 

amino acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 

CGCTC GGGC ACCAGCCGCGGC AAGGMGGAGC TGGGT TGC TGGACGCAGTTGGGGC TCACT TT T CT TCAGCTCCT TC TCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGAT C AGATTGAGTGCGTC TGCCC CGGAAAGAGGGAAGT CGTGGGT TATACCATC CC TTGCTGCAGGAATGAGGAGAA 
TGAGT GT GACTCCTGCC TGAT CCACCCAGGT TGTACCATC TTTGAAAAC TGCAAGAGCTGCCGAAATGGCT C ATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 

GT TCGTGAT GGAGACAACC GCGAT GGCC AGATCATCAAGCGTGTC TGTGGCA&CGAGCGGCCAGCTCC TATCCAGAGCAT AG 

GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 

ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 

TATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGGGCCCAGTCAATGGGTACCAGA 

TAAC AGGGGGCCC TGGGCT TATC AACGGACGCCATGCTAAAAT TGGC ACCGTGGTGTCT T TCT T TTGTAAC AACTCCTAT GT 

TC T TAGTGGCAATGAGAAAAGAACT TGC CAGCAGAATGGAGAGT GGTCAGGGAAAC AGCCCATC TGCATAAAAGCC TGC C GA 

GAAC CAAAGATT TCAGACC TGGTGAGAAGGAGAGTTCTTCCGATGCAGGT TCAGTCAAGGGAGACACCATTACACC AGCT AT 

ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 

ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 

CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCA^ 

CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

GTCACCATGATCAAGAC AGCAGAC C TGAAAGT T GTT T TGGGGAAATTCTACC GGGATGAT GACC GGGAT GAGAAGACCAT CC 
AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
TGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 

C TGCCTTCAC CAAGGTGCTGCCTTT TAAAGAC TGGAT TGAAAGAAATATGAAATGAACC ATGC T CATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 

GTGC CAGGGCT TCTGACTTCAGGGACAAAACTCAGTGAAGGGT GAGTAGACC TCCAT TGC TGGTAGGCT GATGCCGCGT C CA 
CTACTAGGACAGCCAA.TTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCA 

CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 

TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 

GAACTACAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 

ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKI TGGPGL INGRHAKI GTWS FFCNNS YVLS GNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KT QGLRWP WQAAI YRRTSGVHDGSLHKGAW FLVC S GALVNERTVWAAHCVTDLGKVTM IKTADL 
KWLGKFYRDDDRDEKT IQSLQI SAI ILHPNYDPI LLDADIAILKLLDKARI STRVQP ICLAASR 
DLSTSFQESHI TVAGWNVLADVRS PGFKNDTLRSGWSWDSLLCEEQHEDHGI PVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 

ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTGAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 

TTT GGGCC CTT GTAGCTGACAGAAGGTGGCCAGGGAGAAT GCAGCACACTGCT CGGAGAATGPAGGCGCTT CT GTT GC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACGAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGCCACGGCTCC CT CCCCAGAG GTTT CTGCAGCT GCCACCAT CT CCT TAAT GACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAG^ 

GGGCACGGCCCTTT GAGAGAT CCACT ATTAGAAGCAGAT CATTTAAAAAAATAAAT CGAGCTTT GAGT GTT CTT CGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGA 

TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAA.TCGAGTAGATCCCAGTGAAA 

GCCTCTCTAT T AGGCT GGT GGGAGGTAGCGAAAC CCCACTG GT CCATAT CATTATC CAACACATTTAT CGT GAT GGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 

ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

GCAG GAACAAT GGACAGGCCC CGGAT GC CTACAGACCCCGAGAT GACAGCTTT CAT GTGATTCT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGAC^TCTTTCAGGAAGCCGGCTGGAACAGCAATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGC^ACA 

CTCC CAAGCCC CT C CAT CCTACAATTACT TGTCATGAGAAGGT GGTAAATAT CCAAAAAGACC C CGGT GAAT CT CT CG 
GCATGACCGTCGCAGGGGGAGCATCACAtAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 

TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
GGAGTGAGGCAGT GGCATTATT GAAAAGAACATCAT C CT CGATAGTACT CAAAGCTT TGGAAGT CAAAGAGTAT GAGC 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGG 

GGGT CAT GT GGCT GGAATTAC CACGGT GCTT GTATAACTGTAAAGATATT GTATTACGAAGAAACACAGCT GGAAGT C 
TGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCTTTTTTCATCAAATCCATTGTTGAAGGAA 

CACCAGCATACAAT GAT GGAAGAATTAGATGT GGT GATATT CTT CTT GCT GT CAAT GGTAGAAGTACAT CAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 

T TT T ATAGAAT CAAT GAT GGGT C AGAGGAAAAC AGAAAAAT CACAAAT AGGCT AAGAAGT T GAAAC ACT AT ATT T AT C 
TT GT CAGTT TTT ATATT TAAA.GAAAGAATACATT GTAAAAAT GT CAGGAAAAGTAT GAT CATCT AAT GAAAGCCAGT T 
ACAC CT CAGAAAATAT GATT C CAAAAAAA.TTAAAACTACTAGTT TTTTTTCAGTGT GGAGGATTT CT CATT ACT CTAC 
AACATTGTTTATAT T TT TT CT ATT CAATAAAAAGC CCTAAAACAACTAAAAT GAT TGAT T T GTATACCCCACT GAAT T 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAft-GTCTGCCAAGGGTACATTATGGCCATTTTTAATTTACAGCT 

AAAATATTTTTTAAAAT GCATT GCTGAGAAACGTT GCTT T CATCAAACAAGAATAAATATTT TT CAGAAGTT AAA 
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FIGURE do 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKS GSAVANHADQGRENSENTTAPEVFPRLYHL IPDGE ITS IKINRVDPSE SLS IRLV 
GG S E T P L VH 1 1 1 QH I YRDGVI ARDGRLL PGD 1 1 LKVNGMD I SNVPHN YAVRLLRQPCQVLWL T VM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKS SPEEQLGIKLVRKVDEPGVF I FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KTGDI LLNVDGVELTEVSRSEAVALLKRTS S S I VLKALEVKEYE PQEDCS S PAALDSNHNMAPP 
SDWS PS WVMWLELPRCL YNCKD I VLRRNTAGSLG FC I VGGYEE YNGNKPFFIKS I VEGT PAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 
amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAAT CAGATTGGAAAAGCTCAACT TGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACMGGGCTTCAACCTGACT 

TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGT AACTACTT CGTGGGTGCCAT T CAAGAGAT TCCTAAAGCAAAGGAGTTCATGGCTAATT TCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 

GACAACT GTCCTT CTGT GTCT CCTTACCT CAGAGGCCAGAGCAAGCT CATTT TCAAACCAGAT CTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGT GGT T GGCAGGAACAGCACTGGGTACAGGTTACGTT ACAGTGGATAT TT TGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CAT GAGGGT TAAATATT GTAATATGGATACTTGAAGGACTT TATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATTTGAGGACT CTGGTT GAAGGAGATT TAT T TAAATT T GAAGTAATATATT AT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 

TTT TAAAGAGTTTTT GTAAAATGAT TT T GTACAAGT AGGAT ATGAAT TAGCAGTTTACAAGTT TACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 



MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEMZ^TFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKM.QRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 

DLVPENDFNLYKCEEHPKHLWGRNS TGYRLRYS GYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 

SVEHNPLYINI TVDFWFGA 

Irrqportant features: 
Signal peptide : 

amino acids 1-27 




N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 



Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCT CAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACT GAGC TCCCAGATC TGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGGGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TC CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE AA 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQIAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCT TCAACACAACCCC TCT T GTCACCATACAGCCAGAAGGGGGCACTAT CAT AGTGACCCAAA 
ATGGT AATAGGGAGAGAGTAGACTT CCCAGAT GGAGGCTAC TCCCT GAAGC TCAGCAAACTGAAG 
AAGAATGACT CAGGGAT CT AC TAT G T GGGGAT ATAC AGC TCAT CAC T C CAGCAGCCC T CCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAGAAT GGCACCT GTGTGACCAAT C TGACATGCT GCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T T CTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGTGGACAT T T GT CGGG 
AAACT CC TAACAT AT GCCCCCAT T CT GGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTT^AAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAAT CCCCACT CACTGC TCACGATGCCAGACACACCAAGGC TATT TGCC TAT GAGAATG T TA 
T C TAGACAGCAG TGCACTCCCCT AAGTC T C TGCT CA 



APP ID-10063587 



Page 195 of 318 



FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWT FNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD ICRET PNI CPHSGENTE YDT I PHTNRT I LKEDPANT VYS TVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCT GATGGCCAT TCCAGCAACAACAATGTCCT T GACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAAC T GGAAT GT TT CTT TCAT CAT T T T TCAGT GTGATCACAGT CAT T GGTGC TC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAGTT TAAT GGGAATAAAATGTAAGTATCAGT AGT T TGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 

T TMS L TARKRACCNNRT GMFLS S FFSVI TVI GALYCML I S I QALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 

LLVG'ILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cO-lP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 

AGGAGCAGTGAGGAACC TGCAGAGT CACACAGT T GCT GACCAAT TGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 

CAT T T GCCAATAAAGACGATCCC T T CTAC TAT GACT GGAAAAACC T GCAGC TGAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGT CCT G TACC TGAGAAGGCCATCCCACT CATCAC TCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDPF 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites* 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

GT GGACTCTGAGAAGCCCAGGCAGT T GAGGACAGGAGAGAGAAGGC TGCAGAC CCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGT GGGAGGAAGACAC T C TGGAGAGAGAGGGGGCT GGGCAGAGATGAAGT T CCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATAT TGGGGAGGCCC T TGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCGCTTGGCCAAGGGACCAGAGAAGGAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
T T GGCAGACAGGCAGAAGATGTCAT T C GACAC GGAGCAGATGCT GT CCGCGGCTCCT GGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACT CAGCAGGCAGC T T T GGAATGAATCC TCAGGGAGC T CCC TGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T T AAAT C CAAGC T G GG T T T CAT CAAC TGGGAT GC C ATAAAC AAG GAC CAGAGAAGC T C T C 
GCATCCCG TGAC CTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?^^ 
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FIGURE S2 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGS.SSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 

i 

FNFDT FWKNFKSKLGFI NWDAINKDQRS SRI P 

Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-2 69 

Glycosaminoglycan attachment site. 



B- amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 



jj amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCT TCAACAAGAGT GCAAACAT CAT GCT T GACAAGTGGCAGCACC TGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
C T T GGAGCT CAGT GCCC T TGTAGAGAAAAGAAGCCAGCATATCC TCCAGCACATGGACT T TCTG T 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGT CCAAGAC TT TGGATT T CAT T GATGTGCTTC TGC TGAGCAAGGAT GAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWT^ 

L I T PTEEGLKDS TQMS AT YS QG FTVWLGP IIP FI VLCHPDT I RS I TNASAAI APKDNLF IRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDT TAS GL S WVL YNLARHPE YQERCRQEVQELLKDRDPKE I E WDDLAQLPFLTMCVKE S LR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKWLALMLLHFRFLPDHTE PRRKLEL IMRAEGGLWLRVE PLN 
VGLQ 




Important features: 



Transmembrane domains: 



amino acids 13-32 (type II), 77-102 



Cytochrome P450 cysteine heme-iron ligand signature. 



amino acids 461-471 




N-glycosylation sites. 



amino acids 112-116, 168-172 
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FIGURE 

AT CGCATCAATT GGGAGTACCAT CTT CC TCATGGGACCAGTGAAACAGC TGAA.GCGAATGT T TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGT T TTATGAAGC T T TGGAAGGCAC TATGGACAGAAGCTGGT GGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GAT T CGAACAT TT GAGGGT TACTT T TGGAAGCAACAATACAT T CT CGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T C TGGATGT TGT CCCAC TGAATT CCCAT GAATACAAACC TATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA&AAAAAAA^ 
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FIGURE *6 
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MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCSAFWWHNKGLAL I FC I LQSLALT WYSLS F I PFAR 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATGC TCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CT G GTGTT TAAGAT TTT GGACAAAAAGAATGAT GGAC GCATT GACGC GCAGGAGAT CATGCAGT CC CT GC GGGACTT G 
GGAGT CAAGATAT C TGAACAGCAGGCAGAAAAAATT CT CAAGAG CAT GGATAAAAAC GGCACGATGACCAT CGACT GG 
AAC GAGT GGAGAGACTACCAC CT CCTC CACCCCGTGGAAAACAT CCCCGAGATCAT C CT CTACT GGAAGCATT C CACG 
AT CT TT GATGT GGGTGAGAAT CTAAC GGT CCCGGAT GAGTT CACAGT GGAGGAGAG GCAGACGGGGATGT GGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
GAG GTCCATGCCT CCC GCAGCAACAACAT GGGCATCGTT GGT GGCT TCACT CAGAT GATT C GAGAAGGAGGGGC CAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
AT G CTGGACTGCGC CAGGAGGAT CCT GGCCAGAGAGGGGGT GGCC GCCT TCTACAAAGGCTAT GTC CCCAACAT GCT G 
GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 

T T CAAACATAT CCT GC GGACC GAGGG GGCCT T C G GGCT GTACAGGGGGCTGG CC CCCAACTT CATGAAGGT CAT CCCA 
|«& GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
f*% CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 

AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
Fiji GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTC^ 
f?f| CT C CGGGCT CACAT GTGTAAGGACAG GACAT TTT CT GCAGT GCCT GCCAATAGT GAGCTTG GAGCCT GGAGGCCGGCT 

fl TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
||J CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ijfl ATAAT CCAT GAT GAAAGGT GAGGTCACGT GGCCT CCCAGGCCTGACTT C CCAAC CTACAGCATTGACGCCAACT TGGC 

TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
pj CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
\\ GGCCTGGACCCTGT CAGGATGGGCCC CACCT CAGAACCAAACT CACT GT CCC CACT GT GGCAT GAGGGCAGT GGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
3 ACT GTTGGGAAAAGGGT TTTGT C CAGAAGGACAAGCCGGACAAAT GAGC GACTT CT GT GCTT CCAGAGGAAGACGAGG 

f% GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
^ GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
y 1 AACTATTT TTAT AGATT T GTTTAATTAATAGCTT GT CAT TTT CAAGTTCATTTT TTAT T CATATTTAT GTT CATGGT T 

f| GAT T GTACCTT CC CAAGC C CGCC CAGT GGGAT GGGAGGAGGAGGAGAAGGGGGGCCTT GGG C CGCT GCAGT CACAT CT 

GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
W GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 

AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAATAGGAT GCAAAGAT CAAT GCAAAAATT GTTATATAT GAACATATAACTGGAGT CGT CAAAAAG 
CAAATTAAGAAAGAATT GGACGTTAGAAGTT GTCATTTAAAGCAGCCTT CTAATAAAGTT GT TTCAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHS T I FDVGENLT VPDE FTVEERQTGMWWRHL VAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMG I VGG FTQMI REGGARS LWRGNGINVLKIAPESAI K 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSS I YPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKGYVPNMLG 1 1 P YAG I DLAVYE T LKNAWLQH YAYNSADPGVFVLLACGTMS SIC 
GQIASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAV 

VYENLKI TLGVQSR 

Important features: 
Signal peptide: 
amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE sq 



G GAAGG C AGC G G C AGC T CC ACT CAGC CAGT AC C CAGATAC GC T GG GAAC C T T C CC CAGC C ATG G C 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC TCATCATT GGCT T T GGTAT TT CAGGGAGACAC TCCATCACAGT CAC TACTGTCGCCT CAGCT 
GGGAACAT T GGGGAGGATGGAATCCT GAGCT GCAC TT TTGAACCT GACATCAAACT TT C TGATAT 
CGT GATACAAT GGCT GAAGGAAGGTGT T T TAGGCT TGG TCCATGAGT TCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCAC T TC TAAAGGCAAGGGGAATGC TAACCT TGAGTATAAAACT GGAGCC T TCAGCAT GCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
: M TCAACAACACATACT CCTGTAT GAT T GAAAATGACAT TGCCAAAGCAACAGGGGAT AT CAAAGT G 

Jg ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
M CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
I|| GT GCCT TGGCCACAAAAAAGCAT GCAAAGTCAT T GT TACAACAGGGAT C TACAGAACTAT TT CAC 



© CACCAGATAT GACCTAG T T T TAT AT T T C TGGGAGGAAATGAATT CATAT CTAGAAGT CTGGAGT G 



AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 



*w C TATCT TCAAAGACATAT TAGAAG T TGGGAAAATAAT T CATG TGAACTAGACAAGT GT GT TAAGA 



in GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

C| CACCTGGGGAGT GAGAGGACAGGATAGTGCATGT T CT T TGTC TC TGAAT TT T TAGT TATATGTGC 

Li 

W TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 



•4 



2? AAAT TAAGC T GT AGTATGTAC CCTAAGACGC TGC TAAT TGAC TGCCACT TCGCAACTCAGGGGCG 

GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T T C C C AAC T GAC AAAT GC CAAAG T T GAGAAAAAT GAT CAT AAT T T TAGCATAAAC AGAGC AGT 
CGGGGACACCGAT T T TATAAATAAAC TGAGCACC T TC T T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLY3WTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI SWALLPLS P YLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



Transmembrane domain: 



amino acids 258-281 




N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCT TACCGGCAGGGCT GCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGAC TCAGAT CACAGTGGCTATATC TCCAT GAAGGAGC TAAAGCAGGCCCTGG TCAAC TGCA 
ATTGGTCT T CAT T CAATGATGAGACC TGCC TCAT GAT GATAAACATGT T TGACAAGACCAAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
\* AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
W GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
Sf GGCCT T CCGGGAGAAGGACACAGC TGTACAAGGCAACAT CCGGCT CAGC T TCGAGGACT T CG T CA 

CCATGACAGC TT CT CGGATGCT ATGACCCAACCATCTGTGGAGAGT GGAGTGCACCAGGGAC C T T 



if ;| ? 

it -* 



TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGT GAGGACCGGGGCT GAGGCCACACAGATAGGGGCCT GAT GGAGGAGAGGATAGAAGT T GA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

'■^ki- 
rn AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

C| GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 



CT GACCAAGCATGAGAGAGAT CTGTCTATGGGACCAG TGGC T TGGAT TC T GCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAGTCCAG TGAAAT TG TAAGC T T CAATAAAAGGAT GAAAC T CT GA 



APP_ID=10063587 



Page 211 of 318 



FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPOTDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLmiNMFDKTKSGRIDWGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site* 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-rayristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6a 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCGTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTT^ACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAA?y\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGVVEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMtlAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATG GGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGAC7\AATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGA CAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CT GAGT GGCAATAAATAAAATT CGGTAT GC T G 



n V 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 




N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAG GTCCCCTCTCCTGCATCTGTCTCGCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTC TGCCCCT GAGGTCAAGAGAGGATGGGGCT AT T CAC T T T T AT ATAT T TATATAAAAT TAG 




TAGT GAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLL]^GLPFLLARWGI^WGQIQTTSMrENSTVLPSSTSSSSDGNLRPEAITAIIVVFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features 
Signal peptide: 
amino acids 1-19 



Transmembrane domain 

amino acids 56-80 



N-glycosylation site. 

amino acids 3 6-4 0 

cAMP- and c<34P-dependent protein kinase phosphorylation site 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCA6GAATAACTAGAGAGGAACAATGG GGTTAT T CAGAGGT T TT GTTTT C CT CTTAGT TCT GT GCCT GCT GCACCAG 
T CAAATACTT C CT T CATTAAGCT GAATAATAAT GGCTT T GAAGATATT GTCAT T GTT ATAGAT CCT AGT GT GC CAGAA 
GAT GAAAAAATAAT T GAACAAAT AGAGGATAT GGT GACTACAGCTT CTACGTAC CTGT TTGAAGCCACAGAAAAAAGA 
TTT TTTTT CAAAAAT GTAT CTATAT TAAT T CCT GAGAATT GGAAGGAAAAT C CT CAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCATACACCAAGCAGTTCACA 

GAAT GT GGAGAGAAAGGC GAATACATTCACTT CACC CCT GACCTT CTACTT GGAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
T ACCGT GCTAAGT CAAAAAAAAT CGAAGCAACAAGGT GTTCC G CAGGTATCTCT GGTAGAAATAGAGT TTATAAGT GT 
CAAGGAGGCAGCT GTCTTAGTAGAGCAT GCAGAATT GATTCTACAACAAAACT GTAT GGAAAAGATTGT CAAT T CTT T 
C CT GATAAAGTACAAACAGAAAAAGCAT CCATAATGTTTAT GCAAAGT ATT GAT TCT GTTGT T GAATT TTGTAAC GAA 
AAAAC CCATAAT CAAGAAGCT CCAAGC CTACAAAACATAAAGT GCAAT TTTAGAAGTACAT GGGAG GT GATTAGCAAT 
T CT GAGGAT TTTAAAAACACCATACC CATG GT GACACCAC CT C CTCCAC CTGT CTT CT CATT GCTGAAGAT CAGT CAA 
AGAAT TGT GTG CTTAGTT CTT GATAAGT CTGGAAGCAT G GGGGGTAAGGACCGCCTAAATCGAATGAAT CAAGCAGCA 
AAACATTT C CT GCT GCAGACT GT T GAAAATGGAT CCTGGGT G GGGAT GGTT CACTTT GATAGTACT GCCACTAT T GT A 
AATAAGCTAAT CCAAATAAAAAGCAGT GATGAAAGAAACACACT CAT GGCAGGATTACCTACATAT CCT CT GGGAGGA 
ACTT CCAT CTGCT CTGGAATTAAATAT GCAT TT CAGGT GAT T GGAGAGCTACATT CCCAACT CGAT GGATC CGAAGTA 
CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TTT ATTGCTTT GGGAAGAGCT GCTGATGAAGCAGTAATAGAGAT GAGCAAGAT AACAGGAGGAAGT CAT T T TTATGTT 
T CAGATGAAGC T CAGAACAAT GGCCT CATTGATGCT TTT GGGGCTCTTACAT CAGGAAATACT GAT CT CTCCCAGAAG 
T CC CTTCAGCT CGAAAGTAAGGGATTAACACT GAATAGTAAT GCCT GGATGAAC GACACT GT CATAATT GATAGTACA 
GT GGGAAAGGACAC GTT CTTT CT CAT CACAT GGAACAGT CT GCCTCCCAGTATTTCT CT CT GGGATCCCAGT GGAACA 
ijl ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 

T GGGCAT ACAAT CT TCAAGCCAAAGC GAACCCAGAAACATTAACTATTACAGTAACT T CT C GAGCAGCAAAT T CTT CT 
GT GCCT CCAAT CACAGT GAAT GCTAAAAT GAATAAGGACGTAAACAGTTTC C CCAGC CCAAT GATT GTTTACGCAGAA 
ATT CTACAAGGATAT GTAC CT GTT CTT GGAGCCAAT GT GACT GCTTTCATT GAATCACAGAATG GACATACAGAAGTT 
TT GGAACTTTT GGATAAT GGT GCAGGCGCTGAT T CTTT CAAGAATGAT GGAGT CTACT CCAGGTATTTTACAGCAT AT 
ACAGAAAAT GGCAGATATAGCTTAAAAGT T CG GGCT CATGGAGGAGCAAACACT GCCAGGCTAAAATTACGGCCT CCA 
P? CT GAATAGAGC C GCGTACATACCAGGCT GGGTAGTGAACGGGGAAATT GAAGCAAACCCGCCAAGACCT GAAAT T GAT 

GAGGATACT CAGACCACCTTGGAGGAT T T CAGCCGAACAGCAT CCGGAGGT GCATTT GT GGTAT CACAAGT C CCAAG C 
CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
AC AT GGACAGCACCAGGAGATAATTTT GAT GT T GGAAAAGT T CAACGT TAT AT CATAAGAAT AAGT GCAAGTAT TCTT 
jg| GAT CTAAGAGACAGTT TTGAT GAT GCT CT T CAAGTAAATACTACTGAT CTGT CACCAAAGGAGGCCAACTCCAAGGAA 

AGCTTTG CATTTAAAC CAGAAAATAT CTCAGAAGAAAAT GCAACCCACATATTTATT GC CATTAAAAGTATAGATAAA 
Pjl AGCAATTT GACAT CAAAAGTAT CCAACATTGCACAAGTAACTTT GTT TATCCC T CAAGCAAATCCT GAT GACATT GAT 

g| CCTACAC CTACT CCTACT C CTACT CCTACT CCTGATAAAAGT CATAATT CT G GAGTTAAT ATTTCTACGCT GGT ATT G 

\r% T CT GTGAT T GGGT CTGTT GTAAT T GTTAACT T TATT TTAAGTACCACCATTTGAAC CTTAAC GAAGAAAAAAAT CTT C 

W AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
01 GT GT GAT CATAAACTCATAAAAATAAT TT TAAGATGT CGGAAAAG GATACTTTGAT TAAATAAAAACACTCATGGATA 

;|> T GTAAAAACT GT CAAGAT TAAAATTTAATAGTTT CATTTAT TT GTTATTT TAT TT GTAAGAAATAGT GATGAACAAAG 

i\i ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 

GAAATTAAAAT CATCTAT CTGAGTAGT CAAAAT ACAAGTAAAGGAGAGCAAAT AAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGJCKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 

RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTimGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCTDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKZ\NPET 
AANSSVPPI TVNAKMNKDVNS FPS PMI VYAE ILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGE IEANPP 
1*1 RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 

FDVGKVQRYI IRI SAS I LDLRDS FDDALQVNTTDLSPKEANSKES FAFKPENI SEENATH I FIAI 



m 



m KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 



VNFILSTTI 



Signal peptide: 

•-» 

*t amino acids 1-21 



Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGT GT CAT CT TT TT GAT CT CGT GT GT GGCTGC CT TCCTATTT CAAGGAAAGACGCCAAGGTAATTTT GACCCA 
GAGGAGCAAT GAT GTAGC CACCT CCTAAC CT T C CCTT CTTGAACCC CCAGT TAT GCCAGGAT TTACTAGAGAGT GT CA 
ACT CAACCAGCAAGCGGCT CCT T CGGCTTAACTT GT GGTTGGAGGAGAGAACCTTT GT GGGGCT GCGTTCT CTTAGCA 
GTGCTCAGAAGT GACTTGC CT GAGGGT G GACCAGAAGAAAG GAAAGGT C CCCTCTTGCT GTT GGCTGCACAT CAGGAA 
GGCT GT GAT GGGAATGAAGGT GAAAACTT GGAGATTT CACTT CAGT CATTGCTT CT GCCTGCAAGAT CATCCTT TAAA 
AGTAGAGAAGCT GCTCTGT GT G GT GGTTAACT CCAAGAGGCAGAACT CGTT CTAGAAGGAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCC^AACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 

GTT TT GCTGGT GCT CCTCT GCT GT GCTAT CT CT GTC CT GTACAT GT T GGCCT GCAC CCCAAAAGGT GAC GAG GAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGT GAGCAGC CTGAAGCGGCAGAT C GCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCTT CCT GCACT CGCAGGT GGACAAGGCAGAGGTGAATGCTGGCGT CAAGCT GGC CACAGAGTAT GCAGCAGT G 
CCT T TCGAT AGCTTTACT CTACAGAAG GT GTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCT GT G 
AGGAAGGACAAGCGGGAT GAGT T GGT G GAAGCCATT GAATCAGCCTT GGAGACC CT GAACAATCCT GCAGAGAACAGC 
CCCAATCACCGT CCTTACACGGCCTCT GATT T CATAGAAGGGAT CTAC CGAACAGAAAGGGACAAAGGGACATT GTAT 
GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
jM* GT GAAAAAT GAAAAGCT CAACATGGC CAACACGCTTAT CAAT GT TAT CGTGC CT CTAGCAAAAAGGGTGGACAAGTT C 

if*| CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
3J AAAGAAGAAATAAATGAAGTCAAAGGAATACTT GAAAACACTT CCAAAGCTGCCAACTT CAGGAACT TTAC CTT CAT C 

|Q CAGCTGAAT GGAGAATT TT CT CGGGGAAAGGGACTT GAT GT T GGAGC CCGCT T CTGGAAGGGAAGCAACGT CCTT CT C 

TTTTTCTGT GAT GT GGACATCTACTT CACAT CTGAATTCCTCAATAC GTGTAGG CT GAATACACAGC CAGGGAAGAAG 
GTATTT TAT CCAGTTCT TTTCAGT CAGTACAAT CCT GGCATAATATACGGCCAC CATGATGCAGT CC CT CCCTT GGAA 



:! 5 



CAGCAGCT GGT CATAAAGAAGGAAAC T GGATTTT GGAGAGACTTTGGATTT GGGAT GACGT GT CAGTAT C GGTCAGAC 
T T CATCAATATAGGTGGGTTT GAT CT GGACAT CAAAGGCT GGGGCG GAGAGGAT GT GCAC CTTTAT CGCAAGTAT CT C 
CACAGCAAC CT CATAGT GGTACGGAC GCCT GT GCGAGGACT C T TCCACCTCT GGCAT GAGAAGCGCT GCAT GGAC GAG 
CT GACCC C CGAGCAGTACAAGAT GTGCAT GCAGT CCAAGGC CAT GAACGAGGCATC CCACGGCCAGCT GGGCAT GCT G 
GTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAG^ 

GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
GGAC GACAAAAGAATT GGACT GAT GGGTCAGAGAT GAGAAAGCCT CC GATT T CT CT CT GT T G GGCTTTTTACAACAGA 
AAT CAAAAT CT CCG CTTT GCCT GCAAAAGTAACCCAGTT GCACCCT GTGAAGTGTCT GACAAAGGCAGAAT GCT T GT G 
AGAT TATAAGC CTAAT GGT GT GGAGGTTT T GAT GGT GTT TACAATACACTGAGACCTGTT GTTTT GTGT GCT CATT GA 
AATATTCAT GATT TAAGAGCAGTTTT GTAAAAAATT CATTAGCAT GAAAGGCAAGCATAT TT CT CCT CATAT GAAT GA 
GCC TATCAGCAGGGCT CTAGTT TCTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGAT AGGCTT ATTAT GAT ACT 
AGT GAGTACAT TAAGT AAAATAAAAT G GACCAGAAAAGAAAAGAAAC CATAAATAT CGT GT CATAT TTT CC C CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTT TTT CCCTT GTGAGTTATAGTC T GCTTATT TAAT TACCACTTT GCAAGCCT TACAAGAGAGCACAAGTT GGCCTAC 
ATT TTTATATTTT TTAAGAAGATACTTTGAGAT GCATTAT GAGAACTTT CAGTT CAAAGCAT CAAATTGAT GCCATAT 
M CCAAGGACATGCCAAATGCTGATTCTGTCAGGCACTGAATGTCAGGC^TTGAGACATAGGGAAGGAATGGTTTGTACT 

AAT ACAGAC GT ACAGATACTT T CT CTGAAGAGTATT TTC GAAGAGGAGCAACTGAACACT G GAG GAAAAGAAAAT GAC 
ACTTT CT GCTTT ACAGAAAAGGAAACT CATT CAGACT GGT GATATCGTGAT GTAC CTAAAAGTCAGAAACCACATTT T 
CT CCT CAGAAGT AGGGACC G CT TT CTTACCT GTTTAAATAAACCAAAGTATACC GT GTGAACCAAACAAT CT CTTT T C 
AAAACAGGGT GCTC CT CCT GGCTTCT GGCTT CCATAAGAAGAAAT GGAGAAAAATATATATATAT ATATATATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 

TAACTGAATTATTTTTTAAATTAAGCAGT T CTACT CAAT CAC CAAGAT GCTT CTGAAAAT T GCAT TTTATTACCAT T T 
CAAACTATT TTT TAAAAAT AAATACAGTTAACATAGAGT GGTTTCTT CATT CAT GT GAAAATTAT TAGCCAGCAC CAG 
AT GCATGAGCTAATTAT CT CTT TGAGT CCT T GCT T CT GT TT G CT CACAGTAAACT CATT GTTTAAAAGCTT CAAGAAC 
ATT CAAG C T GTT GGT GT GTTAAAAAAT GCATT GTATT GATT T GTACT GGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAATGGAAGGT GGT ATT G CACAGCTAATAAAATAT GATTT GT GGAT AT GAA 
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FIGURE 72 



MMMVRRGLLAW I SRVWLLVLLCCAI S VLYMLACTPKGDEEQLALPRANS PTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AEVNAGVKLATE YAAVP FDS FTLQKVYQLE TGLTRHPEEKPVRKDKRDELVEAI E SALE TLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAJ^FRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRT PVRGL FHLWHEKRCMDELT PE QYKMCMQSKAMNEAS HGQLGMLVFRHE I EAHL 
RKQKQKT S S KKT 

Important features: 




Signal peptide: 



amino acids 1-27 



N-glycosylation sites. 

amino acids 315-319, 324-328 




N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 



Amidation site. 



amino acids 377-381 
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FIGURE 73 

GAGAC T GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T TGTCCT GGGGATCCA 
GAAACCCAT GATACCCTACT GAACACCGAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 

GGAAACAAT GCCCAGTCGCCCAT CGATAT T CAGACAGACAGT GTGACAT TTGACCCT GAT T TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCC.CTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 

T GAAGCCACATT TGCAGAGC TCCACAT TGTACAT T ATGACTCTGAT TCC TATGACAGCT TGAGT G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TAT GAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACT^ACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATT TCAATGGAACAGC T GGAAAAGCTTCAGGGGACATT GT TC TCCACAGAAGAGGAGCC C T C TAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CAT GGATGT GGAT GACT TCCCT T CAT GCC TAT CAGGAAGCCTCTAAAAT GGGGT GTAGGATCT GG 
CCAGAAACACTGTAGGAGTAGT AAGCAGATGT CCTCCT T CCCCTGGACAT CT C T TAGAGAGGAAT 
GGACCCAGGC T GT CAT T CCAGGAAGAACT GCAGAGCCT TCAGCCT CTCCAAACAT GTAGGAGGAA 
AT GAGGAAATCGCTGTG T TGT T AAT GCAGAGANCAAAC TC TGT TT AGT T GCAGGGGAAGT T TGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 

TTTCTGACTTT 
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FIGURE 74 



MLFS ALLLEVI WI IAADGGQHWT YEGPHGQDHWPAS YPECGNNAQS P ID I QTDS VT FDPDLPALQ 



FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFAS FI QAGS S YTTGEMLSLGVGI LVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 



PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYYAAQLHLHWGQKGSPGGSEHQINSEAT 




N-glycosylation site. 

amino acids 213-216 



Transmembrane domain: 



amino acids 291-310 




Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

AT GGAC T CCACCAGAGGGT C TT CAAGGAGT TAAAG T TACT T ACAC TGTGCAGTATT TCAT CACAA 
ATTGGCCCACCAGAGGTGGCAC TGAC TACAGAT GAGAAGT CCATT TC TGTT GTCCT GACAGCT CC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTT TCCATGCAACAAATATACT CCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 

H GAT TT TGAT TTAT GGAAAT GAAT T TGACAAAAGAT TC T TT GT GCC T GCTGAAAAAATC GT GAT TA 

ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

jyj GGAAGAGGAGGT GAAACAT T TAGGGTATGCT T CGCATTT GAT GGAAAT T T T T TGTGACT CT GAAG 

HI AAAACACGGAAGG TAC TTCT CT CACCCAGCAAGAGTC CCTCAGCAGAACAATACCCCCGGATAAA 

W ACAGTCAT TGAATATGAATATGATGT CAGAACCACTGACAT T TGTGCGGGGCCT GAAGAGCAGGA 

GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
m TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
IJ! CAGGAGCACACAGACTC GGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCC T GGT CGACT GGGA 



rs 3 



M TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

JJ? AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

*' it 

£V CCAGACAGGCCACCAGGAGAAAAT GAAACC TATCT CAT GCAAT TCATGGAGGAAT GGGGGTT ATA 



i -ft* 



TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCC T T T GATCCCAGCCAT AAAGTACCT GGGATGAAAGAAGT T T T T TCCAGTT T GT CAGT GT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AGCAGT CAATAAAT GT T T GC CAGAC T GGGTGCAGAAT T TAT T CAGGTGGGTGT 



APP ID=1 0063587 



Page 225 of 3 18 



FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQ 

DQS S E FKAK 1 1 FW YVLP I S I T VFL FS VMG Y S I YRY I HVGKEKHPANL I L I YGNE FDKR FFVPAEK 
I VINFI TLNI SDDSKI SHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTGTGTGGTTTGCTGGCAGCCACCTTGATC 

CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 

ACAGGAGC TGAAGGACCACAACGCCAC CAGCATCCT GCAGCAGCTGCCGCTGC TCAGT GCCAT GC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGT GGAG T TCCACAT GACGAC TGAGGCCCAAGCCACCATCCGCATGGACACCAGT GCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCAtGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTGTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 

TGATCAATGAAAAGGCT GCAGATAAGC T GGGAT CTACCCAGATCGT GAAGAT CCTAACT CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 

TT TACACCAAAGGTGACCAACTTATACTCAACT TGAATAACATCAGCTC TGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 

CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 

VKNQLC PVI EAS FNGMYADLLQLVKVP ISLSIDRLE FDLLYPAI KGDT I QL YLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S SDRI QLMNSG I GWFQPDVLKNI I TE 1 1 HS I LLPNQNGKLRSGVPVSLVKALG 
FEAAE S S L TKDAL VLT PASL WKP S S P VS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins . 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCC TCT GGATGGAATGTGCCACACACAGCACAGGCATCACCCAGT GTGACAT CTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

CATGAAAT T T GAGAT T GGAGAGGC TC TT TACT T GGGCATTAT TTCTTCCCTGT T CT CCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

GTCAGAAGGTGC T GCT GAGGATAGACT GAC T TT GGCCAT TGGAT TGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 

CAGGATGGGAGGACAGGAAGGCAGCC TGGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 



I YS TLLGLPADI QAAQAMMVT S S AI S SLAC 1 1 S WGMRCT VFCQESRAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHGI LRDFYSPLVPDSMKFE IGEALYLG IIS SLFSL IAGI ILCFSCS SQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 




N-glycosylation site. 

amino acids 190-193 



** 1 ^. 



PMP-22 / EMP / MP20 family proteins. 



amino acids 4 6-59 
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FIGURE 8l 

CCCACGCGTCCGCGCCTCTCCCT'TCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCT.TTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGAC CCAAGGGAGCAGGCAGCACAACTGT CAAGATCGTCC T GAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

Si 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACC TGGCCCAAGGCCACACAGCCAGAAT CT T CCACT TGACTCAGAT CAAGAAAGT CAG 
GAAGCAAGAC TT CCAGAAAGAGGCACAGCAC T TC CGACT GC TCGC TGGCCCCCACGAAGGTCACT 
GGAACGTCT T CCTAGCCCAGACCC TGGAGC TGAAGGT CACGGCCAGTCCAGACAAAGTGACCAAG 
AC AT AAC AAAGAC C TAAC AGT T GC AGAT AT GAG C T G T AT AAT T G T T G T TAT TAT AT AT T AAT AAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 



1 ^ 



= ?: 



FIGURE 82 

MVPETOVLSSLLGLMiLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHTSVS PS PDNLRRFALEHE AS DLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGT GTCT C GAT GGAGT AG ACTC TCAGAACAGC GCAGTT T GCC CT CC GCTC ACGC AGAGCCTCT CC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCT CAAGCT GGGATC AGGGCAGT GGC AGGTGTTTGGGCC AGAC AAGC CT GTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTT CTCT AGCGTGGT CC AC CT CT AC AGGG ACGGGAAGGACCAGCC AT TT AT GC AG ATGCCACAGT ATC 
AAGGCAGG AC AAAACT GGTGAAGGATT CTAT TGC GGAGGGGC GCAT CT CTCT GAGGCTGGAAAAC ATT ACT 
GTGT TGGATGCT GGCC TCTATGGGTGCAGG ATT AGT T CCCAGT CT TACT ACCAGAAGGC CAT CT GGGAGCT 
AC AGGT GTCAGC ACT GGGCTCAGT TCCT CT CATT TCCATC AC GGGAT AT GT TGAT AGAGAC ATC CAGCT AC 
TCTGTC AGT C CT CGGGCT GGT T CCCCCGGCCC ACAGC GAAGT GGAAAGGT CC ACAAGGAC AGGAT T TGT CC 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCC GGGAGC AT ATCCTGTTCC AT GCGG CAT GCT CAT CTGAGCCGAGAGGTGGAATCCAGGGT ACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
CT GGGAGGTGGACGGAGGAC AC AAT AAAAGGT GGCGCGTGGG AGTGTGCCGGGATG ATGT GGACAGGAGGA 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGC T T ATTGAGGCCCT AC AT TGAGT AT C C G TCCT AT AATGAGC AAAATGG AACTCCCAT AGT CATCT GC 
CC AGTC ACCC AGGAAT CAGAGAAAGAGGCCT C TT GGC AAAGGGCCT CT GCAATC CC AGAGAC AAGC AACAG 
•PGAGTCCT CCTC AC AG GCAACC AC GCCCTT CGTC CCCAGGGGTGAAAT GTAGGATGAATC ACAT CCCAC AT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 

GAT GAAGGGGGACTGGCC TGTCCACAT GGG AGTC AGGT GT CATGGCTGCCCT GAGCTGGGAGGGAAGAAGG 
C T GACATT AC AT TT AGT T TG CT CT CACT CC ATCT GGCT AAGT GAT CTT GAAATACC ACCT CTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 

ATCTTATTGATGACAGAGTGT ATCCTAATGGT TTGTTCAT TATATT ACACTTTCAGTAAAAAAA 
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FIGURE 84 



MALMLSLVLS LLKLGS GQWQVFGPDKP VQALVGEDAAFS C FLS PKTNAEAME VRFFRGQ FS SWH 
L YRDGKDQP FMQMPQYQGRTKLVKDS IAEGRIS LRLENI T VLDAGL YGCRI S S QS YYQKAIWE LQ 
VSALGS VPL I S I TGYVDRDI QLLCQS SGWFPRPTAKWKGPQGQDLS TDSRTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKA.PQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATCCTGCTGCTGCTGCTGCCCCT 

GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGACCCACATACCAAGAAT T GCACCC TGAGCATCAGAGATGCCAGAAGAAGT GAT GCGGGGAGA 

TACT T CT T TCGTAT GGAGAAAGGAAGTATAAAAT GGAATTATAAACATCACCGGCT CTCT GT GAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCAGCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCGCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 

ATGAGAAAC TGCAGAGACT CACCC TGATTGAGGGATCACAGCCCC T CCAGGCAAGGGAGAAGTCA 

GAGGCTGATTCTTGTAGAAT TAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAAT TATG 

TGCAGAGTGAAAAGCACACAGGCTTTAGAGTC7VAAGTATCTCAAACCTGAATCCACACTGTGCCC 

TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSV3WTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

YSEIKIHR 



Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 

CCAAGGAAAGTGCAGCTGAGAC TCAGACAAGATTACAATGAACCAAC TCAGCTTCCTGCT GT TTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTT TGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCT GGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATGTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 

GT TTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGT TGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCGAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGT TCAGGGTATT TAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGAGAGTT T TGT GGGAGGGAACCCAGACC TCT CCTCCCAACCAT GAGATCCCAAGGAT GGAGAA 
CAACT TACC CAGTAGC T AGAATGT TAATGGCAGAAGAGAAAACAATAAATCATAT T GACT CAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 

SDDYKNPGYYDI QAKDLGI WHVPNKS PMQHWRNS S LLRYRTDTGFLQTLGHNL FGI YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 

TEHHC I GGGGY FPEAS PQQCGDFSGFDWS GYGTHVGY S S SRE I TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 



N-glycosylation site. 

amino acids 163-167 




Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 



N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 

TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 

T TAAGACT TATATACAGT T T TAGGGGACAAT T AAAAAAAAAAAA 
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FIGURE QO 

MGRVS GLVPSRFLT LLAHLVWT TL FWSRDSN I QACLPLT FTPEE YDKQDI QLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 

TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Ql 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGA^CAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCGCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 

GAGT GGGAGAAGGTAGAACGGGGCCGCCGGACCT CGC T TT GCACCCACGACCCAGCCAAGAT CTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGT GGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCT CAAGT GGCAG 
AGAAAGGGTCCCAAGT GCT GG TCCCAACCTGAAGCTGT GGAGTGAC TAGATCACAGGAGCACT GG 
AGGAGGAGTGGGCTC T C TGTGCAGCC TCACAGGGC TT T GCCACGGAGCCACAGAGAGAT GCT GGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCATCCT TAAGCTAAGACAGGACGAT TGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE 02 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 

SVHFQHNATGQGNI S I SLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 



S 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 

ATGT T CCGAT T T GCAT AT TAT AAACT C T TAAAAAAAGCCAGTGAAGGT TT GAAGAGT AT AAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Qd 

MTAAVFFGCAFIAFGPAL^YVFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFIiAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169/ 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



AAT TT TT CACCAGAGTAAACT TGAGAAACCAAC T GGACCT T GAG TAT T GTACAT T T TGCCTCGTG 
GACCCAAAGGTAGCAATC T GAAACATGAG GAG TACGAT TC TACT GT T T TGT CT T CT AGGAT CAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAA.CTGGCTCCGGATCAGG 
GAACAC TACCAAACCAACAGCAGTCAAA.TCAGGTCT TTCCTTCTT TAAGTC TGATACCATTAACA 
CAGATGC T CACACTGGGGCCAGATC TGCATCTGT TAAAT CCTGC TGCAGGAATGACACCT GGTAC 
CCAGACCCACCCAT T GACCCTGGGAGGGT T GAATGT ACAACAGCAAC T GCACCCACAT GT GT TAC 
CAATT TT TGTCACACAACTTGGAGCCCAGGGCACTAT CCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TAT AT T AT GG AAT AGAT T G AG AC ACAT T GG AT AG T C T T AGAAG AAAT T AAT T C T T AAT T 
TACC T GAAAAT AT TCT TGAAAT T TCAGAAAATAT GTT CT AT GTAGAGAATCCCAACT T T TAAAAA 
CAATAATTCAATGGATT^ATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 

T AAAAC A T A T T T G G AAAAC T G G AAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
L PQ I FTS L 1 1 HS LFPGG I LPT S QAGANP DVQDG S LPAGGAGVNPATQGTPAGRL PT PS G 
T DDD FAVT TPAG I QRS THAI EEAT TE SANG I Q 

Signal peptide: 

amino acids 1-16 



APP_ID=1 0063587 



FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCT ACAGGTCACCC T GGAGAT GCAGGAT GGACATGTC T TGT GGGGT CCAC 
AGC C T GT GC T T GTGCACGT GAAGGAT GAGAAT GACCAGGT GCCCCAT T T CT C TCAAGCCAT CT AC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 

GGAT GAGCCAGGCACAGCCAAC TCGGATC T TCGATT CCACATCC TGAGCCAGGCT CCAGCGCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
T GCAGAGGGAAACCTCTACGT GACCAGAGAGCT GGACAGAGAAGCCCAGGCT GAG TACCT GCT CC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

CAGTCCACCAGGT ACTGAAGTGAC TAGAC T GT CAGCAGAGGATGCAGAT GCCCCCGGC T CCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CC TCGAGCCCGCC TT CCGCC TCATGGAT T T TGCCAT TGAGAGGGGAGACACAGAAGGGACT T T TG 
GCCTGGAT TGGGAGCCAGACTCTGGGCATGT T AGACT CAGAC TC TGCAAGAACCT CAGT TATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACC T C TGCACAC CCCGCCAAGACCATGGC T T GATCGT GAGTGGACCCAGCAAGGACCCCGAT CT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 

AT T T T CAC C CAC TGGAC CAT GTCAAGGAAGAAGGACCC GGAT CAACCAGCAGACAGCGT GCCC C T 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCC TGGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCC T CCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAASiAAAAAAAAAAAG 



FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 

EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 

YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 

STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWVSLEPIHLAENLKVLYPHHMAQ 

VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 

VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 

FQVDPTSGSVTLGVLPLRAGQNILLLVIAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 

PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 

EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEASVPI SAPAGS FLLT 

IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 

QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 

REHI I PVWSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVGI LVGTLVAI GI FLI 

LIFTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGAT T CTT CCCAGGAACACAAACGTAGGAGACCCACGCT CC T GGAAGCACCAGCCTT TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT C T AGGACC C AGGCAT CTTGCTTTC CAGC C ACAAAGAGACAGATGAAGAT GC AGAAAGG AAATG 
T TC T CCT TATG T T T GGTCT ACTAT TGCAT T TAGAAGC T GCAACAAAT TCCAATGAGAC TAGCACC 
T CTGCCAACAC T GGAT CCAG T GT GAT C TCCAGT GGAGCCAGCACAGCCACCAAC T C T GGGT CCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 

GCATAGT CACCAACT CTGAGTTCCATACAACCT CCAGT GGGAT CAGCACAGCCACGAACT CT GAG 
T TCAGCAGAGC GT CCAGTGGGATCAGCATAGCCACCAACT CTGAGTCCAGCACAACCT CCAGT GG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 

AGTAGGGCCAGCACTGCCAC CAACT C T GAGTCTAGCACACT CT CCAGT GGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 

GCCACCAAC TCT GAGT CCAGCACAACC TCCAGT GGGGCCAGCACAGCCACCAACT CTGAGT CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
O GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 

fk T CCAGCACGACCT CCAGT GGGGCCAGCACAGCCACCAACT C T GAG TCCAGCACGACCT CCAGT GG 

S GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
iy * CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 

AGTGGGGCCAACACAGCCACCAAC TC T GAG TCCAG TACGACCTCCAGT GGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAAC TCT GAC T CCAGCACAAC C TCCAGT GAGGCCAGCACAGCCACCAACT CT GAGTCTAG 
CACAGTGTCCAGT GGGATCAGCACAG.T CACCAAT T CT GAGT CCAGCACAACC TCCAGT GGGGCCA 
ACACAGCCACCAACT C T GGGTCCAGT GTGACCT C T GCAGGCT CTGGAACAGCAGCT CT GACT GGA 



it 



0 ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 



II GCCGTGGGAAATCTTCCTCATCACCCTGGT.CTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

m TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

ipt CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 

W GTGGAGTCC TAAC TGGT T CT GGAGGAGACCAGTAT CAT CGATAGCCAT GGAGATGAGCGGGAGGA 

CI ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 

«* CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 



AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 

T GC TCAT T TAGCTAAGAAATAAATACAT C TCAT C TAACACACACGACAAAGAGAAGCT GT GC T T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 

CATCTGGCAT TCAAAAT CTCCACAGTAAAATCCAAAGACCTCAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE lOO 

MKMQKGNVLLMFGLLLHLEAA.TNSNETSTSANTGSSVI SSGAS TATNSGS SVTS SGVSTAT I S GS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

AMEMSGRNSGP 



Signal peptide: 

amino acids 1-20 



Transmembrane domain 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT TTCTAGCTACTGCACGTACTTCATAAAGCAGGACTC TAAAAGCT TTGGAATCAT 

GGTGTCAT GGAAAGGGAT T TACT T TAT ACT GAC T CT GT T TT GGGGAAGC t T T TT T GGAAGCAT TT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 

GAATGGAC T GGATGT TCCT GTGGAATTGCCTGAT GCGATAT AGCTACC T CAGATT GGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

C T TC AT TCAT AGGAAAT GGAAGGAT GACAAGAGC CAT T T CGAAGACAT GAT T GAT T AC T T T T GT G 
ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
TCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

CC T TCAAC T C TGGTGCCACAAACGGT GGGAAGAGAAAGAAGAGAGGCTGCGT TCC T T CT AT CAAG 
GGGAGAAGAATTT TTATTT TACCGGACAGAGTGTCATT CCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTAT TT TATAATCACCAT TGTAATCT TTGTGCTGCAAGAGA 
GAATAT T TGGTGGACT GGAGAT CATAGAAC T TGCATGT TACCGAC T T T TACACAAACAGCCACAT 
T TAAAT TCAAAGAAAAAT GAGTAAGAT T ATAAGGT T T GCCAT GT GAAAACCTAGAGCATATTT T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 

GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 

GGGCCAGAATATTATTAAACAATCATCAGGCTT TTAAA 
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FIGURE 1Q2 

MHSRGREIVVLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 

FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCmRYSYLRLEKICLKASLKGVPGFGWi^QMAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLVVKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 

LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-10.8, 354-377 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGGTCATTTTGGTTCTGAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 

CACC T CTACAGAGAT GGGGAAGACTGGGAATCT AAGCAGAT GCCACAG TATCGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCGACAAGGAC 
AGGAT TTGTCTT CAGACT CCAGAGCAAAT GCAGATGGGT ACAGCC TGTATGATGT GGAGATCT CC 
AT TATAGTCCAGGAAAAT GCT GGGAGCATAT T GT GT T CCAT CCACCT TGCT GAGCAGAGTCAT GA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
M< AAATCCAAAGGGAAAATCCAGGCGGAACT GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT T GAG 

O AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
f*l TT TC TGAT CTGAAAACT GTAACCCATAGAAAAGC TC CCCAGGAGGTGCC TCAC T C TGAGAAGAGA 

T T TACAAGGAAGAGT GTGGTGGC TTC T CAGGGT T TCCAAGCAGGGAGACAT TACT GGGAGGTGGA 
f I CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 

W AT GT GAC T T TGT C T CCCAACAATGGGTAT T GGGT CCT CAGACTGACAACAGAACAT TT GTAT T TC 

$1! ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
f» GGAC TAT GAG GG T G G GAC CAT CTCCTTCTT C AAT AC AAAT G ACC AG T C C C T TAT T TAT AC C C T GC 

M TGACATGTCAGT T T GAAGGCT TGT TGAGACCCT AT AT CCAGCAT GCGATGTATGACGAGGAAAAG 

1 * GGGACTCC CATAT TCATAT GTCCAG TGTC C TGGGGATGAGACAGAGAAGACCC T GCTT AAAGGGC 

CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
« TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
?t CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

W ACAGT GATT CCT GCC T CACAGGT GAAGAT TAAAGAGACAACGAATGTGAAT CATGCTT GCAGGT T 

© TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAA.CTCTGTT 

ft| TGCT TATTCCACATTAATT TAC TT TTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 

T GCT T T GTGAGGC TCAAAGAATAAAGAGGAGGTAGGAT TT T T CACT GAT TC TAT AAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 

C T CAT TAACACAGACACAAAAAT T CTAAATAAAAT T T T AACAAAT TAAACTAAACAATATATT TA 
AAGATGATATATAACTACTCAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAAT 

AT CAACCAGT GT AATT CAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 1Q4. 

MAEVLILVLSFYELVSGQWQVTGPGK 

LYRDGEDWESKQMPQYRGRTEFVKDS IAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 

I VQENAGS I LCS I HLAEQSHEVE SKVLIGE T FFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRPCAPQEVPHSEKRF 

TRKS WAS QGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS PNNGYWVLRLTTEHL YFT 
FNPHFI SLPPSTPPTRVGVFLDYEGGT I S FFNTNDQSL I YTLLTCQFEGLLRPYI QHAMYDEEKG 

TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 10* 



CCTT C AC AGG ACTCT T CATT GC TGGTTG GCAATGAT GT ATCGGCCAGATGT GGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
T G AC AAACT AT ATGCT GAGTTT GGCAGAGAGGCT TCT AACAATTT T AC AGAAAT G AGCCAGAGACT T G AAT 
CAAT GGTGAAAAATGC AT TT T AT AAAT CTCCAT T AAGGGAAGAAT T TGTC AAGT CT CAGGTT AT CAAGT TC 
AGT C AACAGAAGC AT G GAGTGT TGGCTC AT AT GCTGTTGATT TGT AGATTTCACTCT ACT GAGG ATCCTGA 
AACT GT AGAT AAAATT GT TCAACT TGTT TT AC ATGAAAAGCT GCAAGATG CT GT AGGACCCCCT AAAGT AG 
ATCC TCACTCAGTT AAAATT AAAAAAAT CAAC AAGACAGAAAC AGAC AGCT ATCTAAACC AT TGCT GCGGA 
AC ACGAAGAAGT AAAACT CT AGGT C AGAGT CTCAGG ATCGTT GGTGGGAC AGAAGT AGAAGAGGGT GAATG 
GCCC TGGCAGGCT AGCCT GC AGTGGGAT GGGAGT CATCGCTGTGGAGC AACCTT AAT T AATGCCAC ATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 

TGACT AT G AT AT TTCT CT T GC AGAGC T T TCT AGCCCTGTT CC CTAC AC AAAT GCAGT ACAT AGAGTTTGT C 
TCCCTGAT GC AT CCT AT GAGTTTC AACC AGGTGATGT GAT GT TTGTGAC AGG AT TTGGAGC ACT GAAAAAT 
GAT GGT TACAGT CAAAAT C ATCTT CGAC AAGC ACAGGTGACT CTC ATAGACGCT AC AACT TGC AAT GAACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCC AGGGT GACTCTGG AGGACC ACTGGT T AGT TC AG AT GCTAGAGATATCTGGT ACCT TGCTGGAAT AGT G 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 

G ATT ACTT CAAAAACT GGT AT C TAAGAG AC AAAAGCC T CATGGAAC AGATAACATTTT TT TTTGTTTTTTG 
GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACT CT GT CATC T GTG AGCAAT AGTTGAAACT TT AT GT AC AT AGAGAAAT AGAT AATACAAT ATT AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 

AAGT ATT AGGTGT T TT TCTTAGTGGAAT ATT AGAAAT GAT CAT ATT CATT AT GAAAGGTCAAGC AAAG AC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 

ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AAT TAT ACAAACTT CAT GCAATG TACT TGT TCT AAGCAAAT TAAAGC AAAT AT TT AT TT AACATT G 
T T AC TGAGGAT G T C AACAT AT AAC AAT AAAAT AT AAAT CACCCA 
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FIGURE 106 

MMYRPDWRARKRVCWE PWVI GLVI FI SL IVLAVCIGLTVHYVRYNQKKTYNYYSTLS FTTDKLY 
AE FGREASNNFTEMS QRLE SMVKNAF YKS PLREE FVKS QVI KFS QQKHGVLAHMLLI CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT T CNE PQAYNDAI TPRMLCAGS LE GKT DACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDW I TSKTG I 



Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGM.GAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGTCACTT TGAAAGCCAGGATGACAAAT TATGGAT TACCTAGAT AT CGGT GGCT TACT CA 
T GCT T GGAAT T T T TT T CAGAGAGAGT TTAAGTGCT GT GGAGTAGTATAT TTCACT GAC T GGT T GG 
AAATGACAGAGAT GGACT GGCCCCCAGAT TCCT GCT GT GT T AGAGAAT TCCCAGGATGT T CCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 

TT TGAGAGGAACCAAACAACTGCAGGTGC TGAGGTT TCTGGGAATCTCCAT TGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GAT GTCCT T GAAGAATGACAAC TCT CAGCACC TGTCAT GT CCCT CAGTAGAACT 
f* GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
% T TGAGATGGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACTTGTT TTATTGGACT 

tjj T GTGAAT T TT T GAGTACATACTAT GTGT T T CAGAAAT AT GT AGAAATAAAAATGT T GCCAT AAAA 

TAACACCTAAGCATAT ACT AT TCTAT GC T T TAAAAT GAGGAT GGAAAAGT T TCAT GTCATAAGTC 
ACCACC TGGACAAT AAT TGATGCCCT TAAAAT GC TGAAGACAGATGTCATACCCACT GT GTAGCC 
ft TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 

TCCATGCAAACGAGT CACAT AT GGTGGGAC TGGAGCCATAGT AAAGGT T GAT T T ACT T CT ACCAA 
C TAGTATATAAAGTACTAAT T AAAT GC T AAC AT AGGAAG T T AGAAAATAC T AAT AAC T T T TAT TA 
^ C TCAGCGATCTATTCT TCTGATGCTAAATAAATTATATATCAGAAAACT TTCAATAT TGGTGACT 

ACCTAAATGTGAT T T T TGC T GGT TAC TAAAAT AT TCT TACCACT TAAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 

TTCAGTTCTGATAATGT TAAGAATAACCATTATGAAAAGGAAAAT TTGTCCTGTATAGCATCAT T 
m AT TTT TAGCCT T T CC TGTTAATAAAGC T T TAC TAT TCTGTCCTGGGCT TAT AT TACACAT AT AAC 

TGT TATT TAAA.TACT TAACCACTAAT T T TGAAAAT TACCAGT GTGATACATAGGAATCAT TAT T C 
GAAT GT AGTCTGGT C T T TAGGAAGTAT TAATAAGAAAA.TTT GCACATAAC T TAGT T GAT T CAGA 
AAGGAC TT GTAT GCT GT T T T T CTCCCAAAT GAAGACT CT TT T TGACACTAAACAC T T TT TAAAAA 
GCT TATCT T TGCCT T CTCCAAACAAGAAGCAATAGTCTCCAAGTCAATAT AAAT T CT ACAGAAAA 

TAGTGT TC T T TT T C TCCAGAAAAATGC T TGTGAGAAT CATTAAAACATGTGACAAT T TAGAGAT T 
CTTTGTTT TAT T T C AC T GAT T AAT AT AC TGT GGCAAAT T AC AC AGAT TAT TAAAT T T T T T T AC AA 
GAGTATAGTAT AT T T AT TT GAAAT GGGAAAAGTGCATT T TAC TGTATT T T GT GTAT T T T GT T TAT 
T T C TCAGAATAT GGAAAGAAAAT TAAAATGTGTCAATAAATAT TTT CTAGAGAGT AA 
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FIGURE 108 

MAREDSWCLRCLLYALNLLFWLM^ 

VMI AVCC FL 1 1 VGMLGYCGTVKRNLLLLAWY FGSLLVI FCVELACGVWT YEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQRE FKCCGVVY FTDWLEMTEMDWPPDS CCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 

MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGTCTT TCTCGT GAGAGCCTAGAGGCCTT AAAAAAAAAAGT GCTT GAAAGAGAAGGGGACAAAGGAACA 
CCAGT AT TAAGAGGATTTT CCAGT GTT TCTGGCAGTTGGTCCAGAAGGATGCCTCC ATTCCT GCTTCTCACCT G 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 

T GGTAC C ACTT CAC GGGCAT GGC GGGAGAT GCC AT GCCT ACCT T CT GCAT AC CAGAAAAC C ACT GT GGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 

TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 

C T GAC CAAGCC CAGC GT CT GCTT CC AC GTCT ACT GT GGT CATTTTT AT GAC ATCT GCGAC GAGG ACT GCC AT GG 
CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
GAGT GTGGGGT T GGCCGTGTGCT AAGAAGTGATGGCAAGACTT GTGAAGACGTTGAAGGATGCCACAATAAC AA 
TGGT GGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGT GTGAATGTCCCCGGGGCCTGGTGCT GT 
CTGAGGAT AACCACACTT GCCAAGTCCCTGT GTTGT GCAAATC AAAT GCC ATTGAAGTGAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 

CC G GGAT GGCT GT GT TT C AGAT GACT C GGT AAAGC AGT AC AC AT C CC GGGAT CACC T AGC AAAGC ACTT CCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 

TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 

CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 

GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 

TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 

CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 

CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 

C AC AAAAT C AGAAGC T G GGT AT AAT AT T T C AAGT T AC AAACCC T AGAAAAAT T AAAC AGT T AC T GAAATTAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 

TT AT AGGGAAT TT GGAAGT GTATCAAT AAAACAGT AT AT AATT TT 



FIGURE HQ 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 

TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACAS FNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEWIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 

CE FPRL YT I S EG Y VPNLRNS PL E IMS RNHG I FP FTLE I FKDNE FEE P Y REAL PTLKLRDSLY FG IE PWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 

LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPI RI DWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGC AGCAGCT T GCTCAGCGGAC AAGGAT GCT GGGCGTGAGGGACCAAGGC CTGCCC TGCACT CGG 
GCCT CCTCCAGCCAGTGCT GAC CAGGGACT TCT GACCT GCT GGCCAGC CAGGACCT GTGT GGGG AGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 

TACAGGAT CCTG AC AGTGATCAAC CTCT GAAC AGCC TCGATGT C AAACCCCTGCGCAAACCC CGT ATCC CC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 

GGT T GT CCTCAT C AAGGT GATT CT GGAT AAAT ACT ACTTCCT CT GC GGGC AGCCT C TC CACT TCATCCCGA 
GGAAGC AGCT GT GTGACGGAGAGCTGGACT GTCCCTTGGG GG AGGACGAGGAGC ACTGTGT C AAGAGCT TC 
CCCGAAGGGCCT GCAGTGGC AGTC CGCCTCT CC AAGGACCGATCC AC ACT GCAGGT GCTGGACT CGGCCAC 
AGGGAACT GGTT CTCT GC CTGT TTCGAC AACTT CACAGAAGC TCTC GC T GAG ACAGCCTGT AGGC AGATGG 
GCTACAGCAG AGCTGT GGAGAT TGGCCCAGACCAGGAT CT GGATGT TGT T GAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGC AT CCAGT ACGAC AAACAGC ACGT CT GT GGAGGG AGCATCCT GGACCCCC ACT GGGTCCT CACGGC A 
GCCC ACTGCT TCAGGAAAC ATACCGAT GTGT TCAACTGGAAGGTGC GGGCAGGCTC AG AC AAACTGGGCAG 
CTTC CC AT CCCTGGCT GT GGCC AAGAT CAT CATC AT T GAATT CAACCC CATGT ACCCCAAAGAC AATG AC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GAT GT C T G ACAT ACT G CT GC AGGCGT CAGT CC AGGT CATT GACAGCAC ACGGTGCAAT GCAGACGATGCGT 
ACCAGGGGGAAGT C ACCGAGAAGATGAT GT GTGC AGGC AT CC CGGAAGGGGGTGT GGACACCTGCC AGGGT 
GACAGT GGT GGGCCCCT G AT GT ACCAAT CT GACCAGT GGC AT GTGGTGGGCATCGTT AGCTGGGGCT AT GG 
CT GC GGGGGCCCGAGCACCCCAGGAGT AT ACACC AAGGT C TC AGCCT ATCTCAACTGGAT CT AC AATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TT CT T GGAGCAGC AAAGGGCCT CAAT TCCT GT AAGAGACCCTC GCAGC CC AG AGGCGCCC AGAGGAAGTCA 
GC AGCCCT AGCTCGGC C ACACT TGGT GC TCCC AGCAT CCC AGGG AGAGACAC AGCCCACT GAAC AAGGT CT 
C AGGGGT ATT GCT AAGCC AAGAAGGAAC TTTCCC AC AC TACT GAAT GGAAGC AGGCTGTCT TGT AAAAGC C 
CAGATCACTGTGGGCT GG AG AGGAGAAGGAAAGGGT CT GCGCCAGC CC TGTCCGT CT T C AC CC ATCCCCAA 
GCCT ACT AGAGCAAGAAACC AG TTGT AAT AT AAAATGC AC TGCCCT ACTGT T GGT ATGACT ACC GTT ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 

AAAA 
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FIGURE 112 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS I LDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPKYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGI VS WG YGC GGPS T PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 

TAGGTGTAAGCCACCGTGT CTGGCC TCT GAACAACT T T T T CAGCAAC TAAAAAAGCCACAGGAG T 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 

GGCCCAAACCT GACC T T CACT C T GGAAGGAGAACAGAGGT T TCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGGCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 

TC TGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCAT TTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
Q GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 

CTCAAT T TAAATCAT GTTC TAGTAAT TGGAGCT GT CCCCAAGACCAAAGGAGC TAGAGCTT GGT T 
CAAATGAT C TCCAAGGGCCCT TAT ACCCCAGGAGACT T TGAT T TGAAT T TGAAACCCCAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 

T TTGGGAGGCCGAGGCGGGTAGATCACCTGAGGT CAGGAGT TCAAGACCAGCC T GGCCAACAT GG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 

CCAGCTACTCGGGAGGCTGAGACAGGAGAAT TACT TGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCAC TT CAGCCTGAGCAACACAGCGAGACTCT GTCTCAGAAAAAATAAAAAAAGAATTA 
TGGTTATTTGTAA 
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FIGURE 114 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACC ATGG CAA 
AGAAT CC TCCAGAGAAT TGT GAAGACTGTCACATT CT AAAT GCAGAAGCTTT TAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T TT GGGGGAGCAAGCAC T TCTGGCC GGAGGTACCCAAAAAAGCCTATGACATGGAGCACAC T T 
TCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTGACCAGAACTGAAATA 
T TCAGAAGCGGAAATGGCAC TGATGAAACAT TGGAAGT GCACGAC T TTAAAAACGGATACACT GG 
CATCTAC TTCG TGGGTC T TCAAAAAT GTT TTAT CAAAAC T CAGAT TAAAGT GAT T CC TGAAT T T T 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACTTTCTTTGAACAGTCAGTG 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T TGTGATAAC GTGACCATGTAT T GGATCAATCCCACTCT AATATCAGTT TCT GAGT TACAAG 
ACT TTGAGGAGGAGGGAGAAGATCT TCACT TTCCTGCCAACGAAAAAAAAGGGAT TGAACAAAA.T 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC T TCCAATAAAT GACTATAC TGAAAATG GAAT AGAAT T TGATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGC TACTACCCATATCCATACTGC TAC CAAGGAGGAC GAGT CATC TGTCGT GT CAT CAT GCCT TG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGT CTAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCT CTCCAGAAT TACT TGTAGGTAAT TCCTCTCTTCATGTTCTAATAAACT TCTACA 
TTATCACCAAAAA&AAAAAAAAAAAAA 

* 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSL^ 

HT FYSNGEKKKI YME I DPVTRTE I FRS GNGTDETLEVHDFKNGYTGI YFVGLQKC FI KTQI KVT P 
EFSEPEEE IDENEE I TTT FFEQSVIWVPAEKPIENRDFLKNSKILE ICDNVTMYWINPTLI SVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 

ERGYCC I YCRRGNRYCRRVCE PLLGY YP YP YCYQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Mi crobodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature, 
amino acids 9-14 
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FIGURE 117 



GAGCTCCCCT CAGGAGCGCGTT AGCT TC AC ACCT TC GGCAGCAGGAGGGCGGG AGCTT CT CGC AGGCGGCA 
GGGCGGGCGGCC AGGATCATGTCC ACC ACC AC AT GCCAAGTGGT GGCGT T CCTCCTGT CC AT CCTGGGGCT 
GGCC GGCTGC AT CGCGGC C ACCGGGATGGACATGTGGAGC AC CC AGGACCTGTACGAC AACCCCGTCACC T 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAG CC AACAT GACAC TGACCTCCGGGAT CAT GT TC ATT GTC TC AGGT CT TTGTGC AATT GCTGGAGTG 
TCTGTGTTT GCC AACATG CT GGTGACTAACT T CT GGATGTCC AC AGCT AACATGT ACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACAC TAAT TGGGGGTGTGATGATGTGCAT CGCCT GC CGGG GCCT GGCACC AGAAGAAACCAACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 



igj CTTCCAAGCACGACTATGTG TAAT GCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 

CC CAAAAAACAAGGAGAT CCCAT CTAGATT T CTT CT T GCT TTTGACTC ACAGCTGGAAGTTAGAAAAGCCT 



W CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

»• _ 



GCTGAGTTAT TT AT GAAT TAGAGGCT AT AGCT CACATT TT CAAT CC TCT ATTTCT TTT TTT AAAT AT AACT 
TT CT ACTCTGATGAGAGAAT GT GGTT TT AATCTCTC TCTCAC ATT TT GAT GATTT AGACAGACT CC CCCT C 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
i ACACTT ACTG AAGAAGAAGC AATAAGAGAAAGATATTTGTAATC TC TCCAGC C CAT GATCT CGGTT TT CTT 



ACACTGTGAT CTT AAAAGTT AC CAAACCAAAGTC ATTTTCAGTT T GAGGCAACC AAACCTTTCT AC TGCTG 
m TT GAC AT CTT CT T ATT AC AGCAAC ACCATT CT AGGAGT TT CCTGAGCTCT CC ACT GGAGT CCTCT T TCT GT 

yi CGCGGGT C AGAAAT TGTCCCT AGAT G AATG AG AAAATT AT T T TTTTT AAT TT AAGTCCT AAAT AT AGT T AA 



AAT AAAT AAT GT T TT AGT AAAATGAT AC ACT ATC TCTGTG AAAT AGCCTC ACCCCT ACATGTGG AT AGAAG 
GAAATGAAAAAAT AATTGCT TT GACATTGTCTAT ATGGTACT TTGTAAAG TCATGCTT AAGT ACAAAT TCC 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TCGAGACT AGCC T GGGCAACAT GGAG AAGCCCTGTCT CT AC AAAAT AC AG AG AGAAAAAATC AGCC AGT CA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
T GGGGCT GCAGT GAGCC ATG AT C AC ACC ACT GCACT CCAGCC AGGTGACAT AGCGAGATCCTGT CT AAAAA 
AAT AAAAAAT AAAT AAT GGAAC AC AGCAAGT CCT AGGAAGT AGGTT AAAACTAATTCTTT AA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPl^LQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 

CAI AGVS VFANMLVTNFWMS T ANMYT GMGGMVQT VQTRYT FGAALFVGWVAGGL T L I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE 119 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGATACCTTCCCATCGCACAACCCAAAAAAGT TATCACACCGGAAAGAAGTCACCGAGCG 
13 TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 

ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
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«a TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

11 f 

CTCT TTTTATCAT TTACTTCAAAATGACATT GCTAAAGACTGCAT TATTTTACTACT GTAATT TCTCC 
ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 
T TTATTTAAAATGAAATGCCAGTCCAT TACACTGAATAAATAGAACTCAACTATTGC TT TT CAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTATAAT GAAGATTAAAATGAAGGCTT TAATCAGCATTGTAAAGGAAATTGAATGGCT TT CT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
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IBs CTTGT GTAT TAAATTAACATTTTTAAAACGCAGATAT TTT GTCA&GGGGCTT TGCAT TCAAACT GCTT 

TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT T TTT GT TTT TGTATTTGAAGAAGAATGATGCATTTT GACAAGAAATCATATATGTAT GGAT 
ATATTTTAATAAG TAT T TGAG TACAGAC T TTGAGG T TT C ATCAAT AT AAAT AAAAGAGCAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT AC AT T T AT AT T AAT AAAT T G T AC AT T T T T C T AAT T 
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FIGURE 12Q 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDSLLALSPDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAHI LLTAGI I FI I TG 
MWL I PVSWVANAI IRDFYNS I VNVAQKRELGEALYLGWTTALVL I VGGALFCCVFCCNEKSS S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTC.CGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 

CTCCGGCAGAGGGAGGT GGTGGACC T GTATAAT GGAATGT GCT TACAAGGGCCAGCAGGAGT GCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 

GAT TC AAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCT T TGAGGAGTCC TGGACACCCAAC TAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 

TACAAAGATGCGT T CAAAT AGTGC TC TAAGAGT TT TGTT CAGTGGCT CAC T TCGGCTAAAAT GCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 

AT TGAAGC TATAATTTAT TTGGACCAAGGAAGCCCTGAAATGAAT TCAACAAT TAATAT TCAT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT TGAAGAACT ACCAAAATAAATGCT T TAAT T T T CAT T T GC TACC TCTTTTTT TAT TAT GCC 
T T GG AAT GG T T CAC T T AAAT G ACAT T T TAAAT AAG T T TAT G T AT ACAT C T GAAT G AAAAG C AAAG 
CTAAATATGTTTACAGAC.CAAAGTGTGATT TCACACTGTT TTTAAATCTAGCATTATTCAT TT TG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATC T T TG TACAAT T T GTAAAT GTTAAGAAT TT T T T T TATATC TGT 
TAAATAAAAATTATT TCCAACA 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQL PAPS SASE I PKGKQKAQLRQRE WDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 

MRSNS ALRVLFS GSLRLKCRNACCQRWYFT FNGAECSGPLP IEAI I YLDQGS PEMNS T INI HRT S 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 

TAT TACC TAGGAAGAT T T T GATGT T T T GC TGCGAATGCGGT GT T GGGAT T TAT T T GT T CT TGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 

TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCT TTGAACAATACAAAGGATGGGT TTCAATG 

TAAT TAGGCTACTGAGCGGAT CAGC TGTAGCACT GG T T ATAGCCCCCACT GT CT TACT GACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 

ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCC T T CAAAAACT TAAGTATAAT CAAT TT AAAGGGCT CAACCAGC T CACCTGGC TATAC 
CT TGACCATAACCAT ATCAGCAATAT T GACGAAAATGC T T T TAATGGAATACGCAGACTCAAAGA 
GC T GATTC T TAGT T CCAATAGAATC TCCTAT T T T CT TAACAATACCT TCAGACCT GTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
H CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
Q AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
#** TCTT TGCTGGCAT GATCAGACTCAAAGAACT TCACCTGGAGCACAATCAATT TTCCAAGCTCAAC 

SJ CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
fl CATAGGACAGACCATGTCCTGGACCTGGAGCTCCT TACAAAGGCTTGAT TTATCAGGCAATGAGA 

¥i TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
|fl TCCAACAAGCTCACAT TTATTGGTCAAGAGATTT TGGATTCT TGGATATCCCTCAATGACATCAG 

m TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
rt T TAAAGGT C TAAGGGAGAATACAATTAT C T GT GCCAGT CCCAAAGAGC TGCAAGGAGTAAAT GT G 

AT CGAT GCAGT GAAGAACT ACAGCAT C T GT GGCAAAAGT AC TACAGAGAGGT T TGAT CT GGCCAG 
* GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
Q TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
IB TTCCATAT^AATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
H CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAAT T T TAT GTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGAT GCT GCT GAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCAT TGTGATAAAAAGAGCTCT TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 

- GGT CAT T T TCCT C TCATACATAAT CAACCCAT T GAAAT T T AAATACCACAATCAAT GT GAAGCT T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 

G T T T T AAGAT AAAAC T T CT T T CAT AGGT AAAAAAAAAAA 



1: 



!i 3 * 
as* 'i< 



APP ID=1 0063587 



Page 273 of 3 1 8 



FIGURE 124 

MGFOTIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKfmCESQKLQEIPSSISAGCLG 
LSLRYNS LQKLKYNQFKGLNQLTWLYLDHNH I SNI DENAFNG IRRLKEL ILS SNRI S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLA^ 

SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTI ICASPKELQGVWIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATE PGPE TDADAEHI S FHKI IAGS VAL FLSVLVILLVI YVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 

AGCAACCT TT CCCCT GGATCT CACAAAAAC TCGACT CCAAATGCAAGGAGAAGCAGC T CTT GCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 

ATT GAAGAGGAAGGCTT TCTAAAGC T T TGGCAAGGAGTGACACCCGCCAT T TACAGACACGT AGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 

AAAACCATT GC GAT T TCGT GGT GTACATC AT GCAT T T GCAAAAAT C T TAGC TGAAGGAGGAATAC 
GAGGGCT T TGGGCAGGCT GGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGAT T TA 

ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 

TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 

TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT TT TGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 

AT CT T GGC TGAGAAT GACCCCT T GGTCAAT GGTGTTCT GGCT TACT TATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 



APP ID=10063587 



Page 275 



FIGURE 126 

MSVPEEEERLLPLTQRWPRA.SKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVIGQFLANPT DLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGGI RGLWAGWP 
NIQRAALVNMGDLTTYDTWHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 

CAAAT GAAGAGGACAT CCTCTCCAT GT TCT CAGCTAT CCGTT CT CAGCACAGCGGTGT AGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

GGACATGT TCAAT GT GAACGTGC TGGCCC T CAGCAT C TGCACACGGGAAGCCTACCAGT CCAT GA 
AGGAGCGGAATGTGGACGATGGGCACATCAT TAACAT CAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
Ml ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
II TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
W CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
^ TCATCTTG TCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
m TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
Ut TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
CI TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 

v. s 

W AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
W AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 

AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVARALVQQG^ 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGI^POT^ 

EAYQSMKERNVDDGHI ININSMS GHRVLPLS VTHFYSATKYAVTALTEGLRQELREAQTHI RATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 

CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 

ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 

CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 

GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 

CCGT T GAGTGC TT GAAATGAGGAAC T GAGAAAAT TAATT TCTCATGTAT TT TTCT CAT T TAT T T A 

T T AAT T T T T AAC T GAT AG T T G T ACAT AT T T GG GGG T AC AT G T GAT AT T T GGAT AC AT G T AT AC AA 

TATATAAT GAT CAAAT CAGGGT AAC T GGGAT AT C CAT CAC AT C AAACAT T TAT T T T T TAT T C T T T 

T TAGACAGAGT CTCAC TCT GTCACCCAGGC TGGAGTGCAGT GGTGCCAT CTCAGCTT ACTGCAAC 

CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

T TCCATCCATGT T GCT GCAAATGACAGGAT T TCGT TCT TAAT TTCAAT TAAAATAACCACACATG 

GCAAAAA 



FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 



Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 



!l .- 



Zinc carboxypeptidases , zinc-binding region 2 signature 



*M amino acids 68-79 
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FIGURE 15U 



TT CT GAAGT AAC GGAAGCT ACCT T GT AT AAAGACCT CAACACTGCTGACCATG ATC AGCGCAGCCT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TC CATCTGTGTGTCGCTGCGAT GCGGGT TTCATT TACT GT AATGATCGCT TT CT GACATCCATT CCAACAG 
GAAT AC CAGAGGATGCT ACAAC TCTCT ACC T TCAGAAC AACC AAAT AAAT AATGCT GGGATT CCTT CAGAT 
TT GAAAAACT TGCTGAAAGT AG AAAGAAT AT AC CTATACCAC AACAGT TT AG AT GAAT TT CC TAC C AACCT 
CCCAAAGT AT GT AAAAGAGT T AC ATT TGC AAG AAAAT AAC AT AAGGACTAT C ACTT ATGATT CACT TT CAA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCC AGGACT AT AGAAGAACT ACGCTT GGATG AT AATCGC AT AT CC AC TATTT C AT CACC ATCTCT TCAAG 
GTCT CACT AGTCTAAAACGCCT GGTTCT AG ATGG AAACCT GT TGAACAAT CATGGT TT AGGT GACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

TCCAGGCACAAACCT G AGGAAGCT TT AT CT TC AAGAT AACCACAT C AATCGGGT GCCCCC AAAT GCT T T TT 



^ AAGGC AGCT CT AT CGACT GGATATGTCCAATAAT AACCT AAGTAATT TACCTCAGGGT ATCTT T 



M CTTATCT. 



w GATG 



ii 5 5 



AT T T GGACAAT AT AAC AC AACTGATT CT TCGC AACAAT CCCT GGTATTGCGGGT GCAAGAT GAAAT G 
GGT ACGTG ACT G GTTACAATC ACT ACCTGT GAAGGT CAACGT GCGT GGGCTCAT GTGCC AAGCCCC AGAAA 
AGGT TCGT GGGAT GGCT ATT AAGGATCT CAATGCAGAACT GT T T GATTGT AAGGAC AGTGGGAT TGTAAGC 
AC CATT CAGAT AACCACT GC AAT ACCCAAC AC AGTGTATCCT GCCCAAGGAC AGTGGCCAGCT C CAGTG AC 
CAAACAGC CAGAT ATT AAGAAC CCCAAG CT CACT AAGGAT CAACAAACC ACAGGGAGT CCCTC AAGAAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 

ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

PI AACAGGGGAACGCAGT GAGT AC TTGGTC AC AGCC CT GGAGCC TGAT TC ACCCTAT AAAGT ATGCAT GGT T C 

TGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 

CGAAT GT ACAACCC T ACAACCACC CTC AAT CGAGAGCAAGAGAAAG AACCT TAC AAAAACCC CAAT T TAC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AAT GGATCGCT C TTCTCAAGGAACTGTGCAT AT AGC AAAGGGAGGAGAAG AAAGGATGACT AT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTT AAACCT AAGGGAGGTGATG 

GT 



W CCA' 

- J 3 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAG I PSDLKNLLKVERI YLYHNSLDEFPTNLPKYVKELHLQENNIRT I TYDSLSKI PYL 
EELHLDDNS VSAVS I EEGAFRDSNYLRLL FLSRNHLST IPWGLPRT IEELRLDDNRI S T I SS PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 

KVNVRGLMCQAPEKVRGMAI KDLNAELFDCKDSG I VS T I QI T TAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALIJ^VCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
I LE IRE T S FQMLP I SNE P I SKE E FVI HT I FP PNGMNLYKNNHSES S SNRS YRDS G I PDS DHS HS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site* 

amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCGTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 

TGCAGAGGCAGT CT GGGC T TGGCCAGAGC T CAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACT CAT TCT CAGAGGAGAGAAGT GACGT GTGCCT GGTGCAGC TGCT GGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCT GTCCCACCAACTGC TCT T C T T CCT CT GGGCCAGAATGAGGGGAT GCACACAGG 
GACCAC TCCAACAGAGCCAGGAC TATATCAACCTC T T CTGCGCCAACATGAT GGAC TTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 

AACAGCAGGAAGGAT GCTT CGGGGAGCC TGATGC TGAAGAT GAAGAAT TATC TAAAGCTATT CAA 
TATCAGCAGCATTTTTCGAGGAGAGTGAAGAGGCGAGAAAAACAATTTCCAGATTCTCGCTCTGT 

TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GT GGC T TCC TAT ACAT CCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGS LGLARAQGAERVTS SE QRPAMASLGLLLLLLLTALPPLWS S S LPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE las 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGT TT T GTGGTT CATGATATACCT T CT GGAT CTTATGTAGTGGAAGT TGTAT CT CC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATATG 
T GAATTACATCAAAACATCAGAGGT TGTCAGAC TGCCCT AT CCTC TCCAAATGAAAT CT T CAGGT 
CCACC T TC T TAC T T TATTAAAAGGGAATCGTGGGGC T GGACAGAC TT TC TAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 

C TGACATGAGACGGGAAAT GGAGCAGTCAATGAATAT GCT GAAT T CCAACCATGAGTTGCCT GAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGT^AGCAACTACTATAAACTT 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAAT AT TGAT GT CAC T GAAT TAAT TACAGT GT CCTATAGAAAATGCCAT T AAT AAAT TAT 
AT GAAC TACTATACAT TATGTATAT TAAT TAAAACAT C T TAATCCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVG FLKTDGS FWHDI PS GS YWEWS PAYRFDPVRVDI T SKGKMRARYVNYIKTSE 
VWLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVLPLLIF^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

* 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 127 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 

CACCAACT CGCACTCAGACTC T GAAC TCAGACC T GAAAT CTTC TC T TCACGGGAGGCT T GGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGC T GT GTGAT CGCCACAAACCT TCAGGAAAT ACGAAATGGAT T T TC TGAGATACGGGGCAGTG 
TGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTAAGGAGGACTGAGTCTTTGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT T CT GAGTCACT TT GAAAAGC TGGAACCTCAGGCAGCAGTTGT GAAGGC 
T T TGGGGGAAC T AGAC AT TCTT C T GCAATGGAT GGAGGAGACAGAATAGGAGGAAAGT GAT GCT G 
C T GCT AAGAAT AT T CGAGGT CAAGAGC T CCAGT C T T CAAT ACCTGCAGAGGAGGCAT GACCCC AA 
ACCACCATCTCT TTACTGT ACTAGTCT TGTGCT GGTCACAGTGTATCT TAT TTAT GCAT TACT TG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 

T G T AAT AT CTTTCTGCTATTG GAT AT AT T TAT TAG T T AAT AT AT T TAT T TAT TTTTTGCTATTTA 
ATGTATTTATTTT TTTACTTGGACATGAAACTTTAAAAAAAT TCACAGATTATATT TATAACCTG 
ACTAGAGCAGGTGATGTAT TTTTATACAGTAAAAAAAAAAAACCT TGTAAAT TCTAGAAGAGTGG 
CTAGGGGGGT TATTCAT TT GTATTCAACTAAGGACATAT T T AC TCATGC TGATGC TCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 

AAAAAAAAAAA 



APP ID=1 0063587 



Page 287 of 318 



i 



in 



•«[**■; J* 

?. ? v 

u *«.» 

a* * 

■? * - 

■* *3l 



FIGURE 15*8 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSIAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 



Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 



N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 



APP ID=10063587 



Page 288 of 3 18 



FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCT C TCT T CAG TAAAGT TGT TAT T GT TCT GATAGAT GCCT TGAGAGATGAT TT TGTGT T TG 
GGTCAAAGGGT GTGAAAT TT AT GCCCTACACAACT TACCT T GTGGAAAAAGGAGCAT CT CAGAGT 
T T T GTGGCT GAAGCAAAGCCACC TACAGT TACTAT GCCTCGAAT CAAGGCAT TGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 

TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGT CT TTTAT GGAGATGAAACCT GGGT TAAATTA 

T TCCCAAAGCAT TT T GT GGAATATGAT GGAACAACCT CAT T T T TCGTG TCAGAT TACACAGAGGT 

GGATAATAAT GT CACGAGGCAT T TGGATAAAGTAT TAAAAAGAGGAGAT TGGGACAT AT TAAT CC 

TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 

AAGC T GAGCGAGAT GGACAGCGT GCT GATGAAGATCCACACCTCACTGCAGT CGAAGGAGAGAGA 

GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 

GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 

CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 

ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 

CAATGAGAGAGCAGT T GAGATT T T TACAT T TGAAT ACAGT GCAGC TT AGTAAACT GT T GCAAGAG 

AATGT GCCGTCAT AT GAAAAAGATCC T GGGT T T GAGCAGT T T AAAAT GT CAGAAAGATT GCAT GG 

GAAC TGGATCAGACT GTAC T TGGAGGAAAAGCATT CAGAAGT CCTAT TCAACCT GGGCT CCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 

TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 

CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 

CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 

GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 

AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 

CAAGG T GAT T GT AAAGAGCT GGCGGTCACAGAGGAACAAGCCCCCCAGC TGAGGGGGT GT GT GAA 

T CGGACAGCC TCCCAGCAGAGGT GTGGGAGCTGCAGC T GAGGGAAGAAGAGACAAT CGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 

GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 

TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 

AC TCAGAT CCACAGAGCCCAGGAT CAAGGGACCCACT GCAGT GGCAGCAGGACTGTTGGGCCCCC 

ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 

CTGACCGAGACACTCACAGCT TTGTCATCAGGGCACAGGCTT CCTCGGAGCCAGGATGAT CTGTG 

CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 

C TGCACACAGTAT GTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 



FIGURE 14Q 

MRLGS GTFATCCVAI EVLG I AVFLRG FFPAPVRS SARAEHGAEPPAPEP SAGAS SNWT TLPP PLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGM 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGAS STEEVNTPLILIS SAFERKPGDI RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 

GAAAGT TAC TAGTGCCCTAAAGC TGGCGC TGGCACT GATGT TAC TGC T GCTGTT GGAGTACAAC T 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 

CCGAAGAAATT CAGCAT TCATGACCAGGAT CACAAAGTACTGGT CCTGGAC TCT GGGAATCT CAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 

T GTGACAAGGAT AAAGGACAAAGTCATCCATCCC T TCAGC T GAAGAAGGAGAAACT GAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 

ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 

GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCATTTCAACCAGTTTGCAA 

AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

ATT TGAAC TAAT T GTATAAAAACACCAAACCTGCT CACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 

FFALAS S L S SAS AEKGS P I LLGVSKGE FCL YCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE ld3 



C TAGAGAGTATAGGGCAGAAGGATGGCAGATGAGT GACT CCACATCCAGAGC TGC CT CCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 

CAGT GGAGT T TGCT GT CCACACATT CAACCAACAGAGCAAGGAC TAC TATGCCTACAGACT GGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 

GGGGAGAAC TAGGTGT GGGAAATTTGAAGACGACAT T GACAACTGCCAT T TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCT TGGAGGGATTCCACTGAGT GAAACCCACTCACAGGCTTGT CCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCCT AATCATGTAGTGTAGATCAT TAAACAT CAGCAT T T TAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHlSrVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGH I LNS WKE QVESKTVFSME LLLGRTRCGKFE DDI DNCHFQE S TELNNT FTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 

GACGAGGGGCAT CAAGCACAGAATCAAGT GGAAC CGGAAGGCCCT GCCCAGCACT GCCCAGAT CA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

CCGTCACT CCAT TCT CAGCCCC TAGCAGAGCGT CTGGCACACTAGAT TAGTAGT AAAT GCT T GAT 
GAGAAGAACACATCAGGCACTGCGCCACCT GCT T CACAGTACT TCCCAACAAC T C T T AGAGGT AG 
GTGTAT TCCCGT T TTACAGAT AAGGAAAC T GAGGCCCAGAGAGC T GAAGTAC TGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TCTAATGAAAT TGTGAAAGCTCCATGTTTAGAAATAAATGAAAACACCTGA 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 

VLWRLVQELC S LKHCE FWLERGAGLRVTMHQPVLLCLLAL I WLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 




N-glycosylation sites. 

amino acids 98-102, 110-114 




Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 




N-myr i s toyla tion s i tes . 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 




Amidation site. 



amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACT TT CAGAT TCAGGGGGTACATGTGAAGGT T T G T T TT ATGAGTATATTGCATGAT GCT GAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 



i! S 
:; 2 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
AGAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTGCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 

GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAA^ 

T GTT GATAACT TGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAAT CTA 
T AAG AAG CT C C C C AC T T GAT AAT AAGT T GAAT GT G GAAG AT GT T GAT T CAAC C AAGAAT C G AAAACT G 
ATCGATGATTATGACT CTACT AAGAGT GGAT TGGATCATAAATTTCAAGATGATCCAGATGGTCT TCA 

TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATT GTCCATAAAATCGCT GCCAGGATT TAT GAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 

T TAT GAGGAGGATCCCAAT AAGCCCACAAGCTGGACT GAGAAT CAGGCT GGAAAAATAC CAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
ATAT TTCCCAAAT TTCTATGCGCTACTGAAAAGTATT GAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA 

C AC T GAT T AC TAT CAT GAAAACAC T GAT T GACT T T GT GAAGAT GATGG T GAAATAT G GAACAAT AT C T 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 

GATGATAACTC CAACCCAGGAGGAAAGACAGAT GAACCCAAAGGAAAAACAGAAGC CTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATT T ATAGCAGCCT GTAAAAATGGCAAAAGATCCAGGAGT CT T TCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE l*Q 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 

RRTKT YSEDNFEELQYFPNFYALLKS IDS EKEAKEKE TL I T IMKTL I DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 

EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 

CGGC TCGAGGC TCCCGCCAGGAGAAAGGAACAT T CT GAGGGGAGTCTACACCCT GT GGAGCT CAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 

ATAATAACCAGC T TCTAGC T GGAGGGCTGCAT GCAGGGAAGGTCAT TAAAGGTGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 

CCAGTGCC TGTCAT GTGGGGT GGGGCAGGAGCCGACT CTAACAC TAGAGCCAGT GAACAT CATGG 
AGCTGTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 

GAGTGGAGGAGACCCATGGCGGACAATCAC TCTCTCT GCT C TCAGGACCCCCACGTC T GAC T TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGC TCGAGCTCAGAAGATAAAAGATAAGTAGGGT AT GC T GAT CC TC T TT TAAAAACCCAA 
GATACAAT CAAAAT CCCAGAT GC TGGT CTC TAT T CCCAT GAAAAAGTGC TCATGACATATT GAGA 
AGACC TACT TACAAAGT GGCATATAT TGCAAT T TAT T T TAAT TAAAAGATACC TATTTAT ATATT 
TCTT TATAGAAAAAAGTCT GGAAGAGT TTACTT CAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TCCTTGTATAAAAATAAGAAAAGAAAT TAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T TGTAGTAGTGATCAGGAAACAGAT CT CAGCAAAGCCACTGAGGAGGAGGC T GT GCTGAGT T TGT 

GTGGCTGGAAT CT CT GGG TAAGGAAC T TAAAGAACAAAAAT CATC TGGTAAT TCT T TCCT AGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
G T CTC TGCAGAT GTAGTT AGT TAAGACAAGGTCATGCT GGAT GAAGGTAGACCTAAAT T CAATAT 
GACTGGT T TCCT TGTATGA&AAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTAT GTA 
AAGATGAAGGCAGAGATCGGAGT TT TGCAGCCACAAGCTAAGAAACACCAAGGATT GTGGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAAT TCT TCCCTAGAGGCT TTAGAGGGATAACGGCTCTG 
C TGAAACC T TAAT CT CAGAC T TCCAGCC TCC T GAACGAAGAAAGAATAAAT T TCGGC T GT TT T AA 
GCCACCAAGGATAAT TGGT TACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGC TGCCATGCT AT GAGCAGGCC TATAAAGAGACTTACGTGGTAAAAAAT GAAGT CT CCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGT GAGATAATCGATGT TTGTTGT T TT 
AAGT TGCTCAGTT TTGGTCTAACT TGT TATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFI^DSALKVLYLHNNQ^ 

QCLSCGVGQEPTLTLEPWIMELYLGAKESKS FT FYRRDMGLTS S FESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myxistoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



HI 



w 



Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 



IKS .j 

3 IS? 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 

CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 

CCC TC T T GGTACAGGGAGGAGCAGC T GCGC CCATCAGCT C CCAC T GCAGGC T TGACAAGT CCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 

CAACACAGACGT TCGTCT CATTGGGGAGAAACTGTT CCACGGAGTCAGTAT GAGT GAGCGCT GCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGAT GACCT GCATATCCAGAGGAAT GT GCAAAAGC TGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGT GGAGAGAT CAAAGCAAT T GGAGAACT GGAT TTGCTGT T TATGT C TCT GAGAAAT 
GCCTGCAT T TGACCAGAGCAAAGC TGAAAAATGAATAACTAACCCCC T T TCCCTGC TAGAAATAA 
CAAT TAGATGCCCCAAAGCGATTT TT TTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGAT TCCAAAT GAACCCC TGCGT TAGT T ACAAAGGAAACCAAT GCCACT T TTGT TTAT A 
AGACCAGAAGGTAGAC T T TCTAAGCATAGATATT TATT GATAACATT TCATTGT AACT GGTGT T C 
T AT AC AC AGAAAAC AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C ATAAAAAAGAT T AC T T T C CAT 
T CC T T TAGGGGAAAAAACCCCTAAATAGC T TCAT GT T T CCATAAT CAGTACT T TAT AT T TATAAA 
TGTAT T TAT TAT TATT ATAAGAC TGCAT T T TAT T T ATATCATT T TAT TAATAT GGAT TTAT T TAT 
AGAAACATCATT CGATATT GCTAC T TGAGT GTAAGGCTAAT AT TGATAT TTAT GACAAT AAT TAT 
AGAGCTATAACATGT TTATT TGACCTCAATAAACACT TGGATATCCC 



10063587 



Page 303 



m 



FIGURE iZd 

MAALQKS VS S FLMGTLAT SCLLLLALLVQGGAAAPI S SHCRLDKSNFQQP Y I TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 

TCH IEGDDLH I QRNVQKLKDTVKKLGE S GE IKAI GELDLL FMSLRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites . 

amino acids 54-58, 68-72, 97-101 



0§ 

W N-myristoylation sites 



y '* ■ 

m 
mi 

W amino acids 10-21 

H 

h > 
r& ^ 



amino acids 14-20, 82-88 



Prokaryotic membrane lipoprotein lipid attachment site 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 

CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 

TGCCTGAGGGCT GT GC TGAGAGGGAGAGGAGCAGAGATGCT GC TGAGGGTGGAGGGAGGCCAAGC 

TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 

GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 

TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 

CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 

GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 

CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 

TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 

GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 

AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 

GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 

CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 

GCCCAGCACAGGCACT T T CT AGATAT T TCCCCC T T GCT GGAGAAGAAAGAGCCCC T GGT T TTAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 

T T GACATGGAT GAT TCTGAGGAGGAAGC TGTTAT TGAATGTATAGAGAT T TATCCAAAT AAATAT 
C T T T AT T T AAAAAT GAAAAA 



10063587 



Page 305 



FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVP 

RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 

ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site, 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myr i s toyla ti on si tes . 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CGGGAGAC T TGAGGGACCT CCGAGTAGAACC T GT T ACAACTAGT GT T GCAACAGGGGACT AT TCA 
AT T T T GAT GAAT GTAAGCT GGGTAC TCCGGGCAGAT GCCAGCAT CCGCTTGT TGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 

GAGCTGAACACAGT CT AT T T CAT TGGGGCCCATAATATTCC TAATGCAAATATGAAT GAAGAT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 

GTAGAAGT GAACT TCACAACCACTCCGCTGGGAAACAGATACAT GGC T C T TATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

W GT GAT C TAAGAAGCCAGAT T CAT CTGCACAAATACGT GGTGGTCTACT T TAGAGAGAT TGAT ACA 

AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 

ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWI^QHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSeVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNEDGPSMSWFTSPGCLDHIMK^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 

QN HC RS E V I L EKWQKKKI AEMG P VQWLATQKKAADKW FLLSND VN S VCDGT CGKSEGSPSENS QDL F PLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and c GMP -dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGACAGTGA?VGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 

AC TT GGGC TGCATCAATGC TCAAGGAAAGGAAGACAT CTCCATGAAT TCCG T TCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

AC TCAGC TGAAGAAG 
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FIGURE 160 



mtvktlhgp™vkylllsilglaflseaaarkipkvghtffqkpescppvpggsmkldigiinen 

QRVSMSRNI E SRST S PWNYTVTWDPNRY PSEWQAQCRNLGC INAQGKEDI SMNSVP I QQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVT PVIHHVQ 

Si gnal sequence : 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 

ACAC TGGCCAAACAAAAACGAAAGCACT CCGT GCT GGAAGTAGGAGGAGAGTCAGGAC T C CCAGG 
ACAGAGAGTGCACAAAC TACCCAGCACAGCCCCCT CCGCCCCC TCTGGAGGC T GAAGAGGGAT T C 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGT TTGGAGGAGCAGCT GACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
U CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
m CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
SI CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
^ GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
P- TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
Ul AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
jjf! CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
m GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
H CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGAT-GACTTGGGAGCGCTATGGGCCTG 
"1 CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
S CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
fl CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
S5 CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
I!! CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
« CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
flj GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
fl GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
Si GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
j ^ CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 

GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 

C GGGAC T TAAAT AAAG G C AGACGC T G T T T T T C T AAAAAAA 
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FIGURE162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEE PRNASLQAQVVLS FQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDBWLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 

GQCL QLWDDDLG AL WACPMDKY IH KR WAL VWL ACLL FAAAL S L I LL LKKD HAKGWL RL L KQDVRS GAAARG 
RAALLL YSADDSGFERLVGALASALCQLPLRVAVDLWSRREL SAQGPVAW FHAQRRQTLQEGGWVLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394. 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, .319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGA.GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCC T GACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGT CTACAGCAT CGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCT GTCAGCGGATCACCCGGAAGTCCT GC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGAT GTGACCTGTATCTCCAAAGTGAGATCGAT TCAGATGATT GTT CATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TCTACCACT T AGAGC TCCAGGT CAAC CGCACCTACCAAAT GCACCT TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

U T CCACAGCGGCATAGC CT GT CCGAGATCACC TACT TAGGGCAGC CAGACATC TCCATCCTCCAGC 

tf *| CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

«J GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

W- GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

P. CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

W CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

^ T TC TCT GCAGGAGGT GACC TCC T TGGCTATGGAGGAAT CCCAAGAAGCAAAATCAT T GCACCAGC 

fj C CCT GGGGAT TT GCACAGACAGAACATCTGACCCAAAT GT GCTACACAGTGGGGAGGAAGGGACA 

W CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
J CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
GAGCAGCCCACAGAACT GGAT T C T C T T T TCAGAGGCCT GGCCC TGACTGTGCAGT GGGAGTCCTG 
m AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
S» ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
W AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
Ui AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
£! CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
Rjj CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GC CTGGAAAAGAACCAGGC CT GGAAAAGAACCAGAAGGAGGC T GGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

T T CCCAGCCAGGGCAAC T GCCT GACGT T GCACGAT T TCAGCT TCAT TCC TCT GAT AGAACAAAGC 
GAAATGCAGGT CCACCAGGGAGGGAGACACACAAGCCT T TTCTGCAGGCAGGAGT T T CAGACCCT 
AT CCT GAGAAT GGGGT T T GAAAGGAAGGTGAGGGC T GT GGCCCCT GGACGGGTACAAT AACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 

YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 

EESQEAKSLHQPLGICTDRTSDP3WLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 

SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAT^GTCACCCGGGCCCGCGGTGGCCACi^CATGG 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 

AAAAAGGTGACGAT GTAT ATGTC TAC TACAAAC T GGCAGGGGGATCC C T TGAAC T T TGGGCTGGA 
AGTGTTGAACACAGTT TTGGATAT TT TCCAAAAGATT TGATCAAGGTACTTCATAAATACACGGA 

AGAAGAGC TACATAT T CCAGCAGATGAGACAGACTT T GT CT GCT TTGAAGGAGGAAGAGAT GAT T 
T TAATAGT TATAAT GTAGAAGAGCT TT TAGGAT CT T T GGAACTGGAGGACTC TGTACC TGAAGAG 
TCGAAGAAAGCTGAAGAAGT T TCTCAGCACAGAGAGAAAT CT CCT GAGGAGT CT CGGGGGCGT GA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 

CAGAGAGCACCGAGGGGCT GCAGGGACAGCCCT CAGCT CAGGAGAGCCACCC TCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CATTACAGCAAAGGATTTCGTTGGCATCAAAATCTAAGTTTGTTTTACAAAGATTGTTTTTAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 

AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWVPGQS^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 

EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



Tyrosine kinase phosphorylation site. 

amino acids 67-76 



N-myristoylation sites 



W amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCC T TGAGAAT GAAGCAGAACAGAAGT TAATAGAGAGCAT GTT GCAAAACC TGACAAAACCCGGG 
ACAGGGATT TC T GAT GGT GATT TC TGGATAGGGCT T T GGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C T TGGGGGTCCCTACCTT TACCAGTGGAATGAT GACAGGT GT AACAT GAAGCACAATT AT AT T TG 
CAAGTAT GAACCAGAGAT TAATCCAACAGCCCCT GTAGAAAAGCC TTAT CT TACAAAT CAACCAG 
ji 5 , GAGACACCCAT CAGAATGT GGTT GTTAC T GAAGCAGGTATAAT TCCCAATC TAAT T TAT GT T GT T 

P ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 

W GCATAAAAGTAAAGGAAGAACAAAAAC T AGTCCAAACCAGT CTACAC T G TGGATT T CAAAGAGTA 

C| CCAGAAAAGAAAGTGGCAT GGAAGTATAATAACT CATTGACTTGGTTCCAGAATT TTGTAATT CT 

fl GGAT C TGTATAAGGAATGGCATCAGAACAAT AGCTT GGAAT GGC T TGAAATCACAAAGGATCT GC 

|'4 AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 

TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
AC TT CAAACTT CAAGCAAAT GAAATGGACAATGCAGATAAAGTTGT TAT CAACACGTCGGGAGTA 
m TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 

TGTATATTGTAT TGAAATT TACAGTGTGCAAAAGTATT TTACCT T TGCATAAGTGTT TGATAAAA 
AT GAACTGT TC T AATAT T TAT T T T TAT GGCATCT CATT TT TCAATACAT GC T C TT T T GATTAAAG 
AAAC T TATTACT GTT GT CAACTGAAT TCACACACACACAAATATAG TACCATAGAAAAAGT T T GT 
|Fg TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 

AATAAGAAGC T AT TT CAT TAAGTGTGATATAAACC TCCT CAAACAT T T TAC T T AGAGGCAAGGAT 
|2 TGTC TAAT T T CAAT T GT GCAAGACAT GT GCC TTAT AATTAT T T T TAGC T TAAAAT T AAACAGAT T 

|l| T TGTAATAATGTAACT TTGTTAATAGGTGCATAAACACTAATGCAGTCAATT TGAACAAAAGAAG 

m ' TGACATACACAATATAAAT CATAT GTC T TCACACGTT GCCTATATAAT GAGAAGCAGCTCTCTGA 

J2? GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 

M AT TGACAC TGGAGGCAGAT AGTT GCAAAGT TAGT CTAAGGT TTCCC TAGC TG TAT T TAGCCT C T G 

AC TAT AT TAGTATACAAAGAGGTCAT GT GGTTGAGACCAGGT GAATAGT CAC TAT CAGT GT GGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAACT ACGAAATCGT GT GAAAATGGGT T GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMA.YFHELSSRVS FQEARLACESE 

GGVLLSLENEAEQKL I E SMLQNLTKPGTG I S DGDFW I GLWRNGDGQTS GACPDL YQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFC^LHKSKGRTKTSPNQ 

S TLWI SKS TRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 



Transmembrane domain: 

amino acids 214-235 



N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 



p| amino acids 2 66-2 69 



N-myristoylation sites. 



m amino acids 27-32, 66-71/ 91-96, 93-98, 102-107, 109-114, 140 

PI 145, 212-217 

w 

n 

• 1 '2 --• 
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